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DNA POLYMERASES WITH ENHANCED 
LENGTH OF PRIMER EXTENSION 

CROSS REFERENCE TO RELATED APPLICATION 

This application is a continuation-in-part of co- 
5 pending application Serial No. 08/483,535 filed June 7, 

1995, which is a continuation-in-part of Patent No. 
5,436,149, filed February 19, 1993. 

BACKGROUND OF THE INVENTION 

t The present invention is directed to DNA 

Lp polymerases, and more particularly, to a novel 

formulation of DNA polymerases, which formulation of 
enzymes is capable of efficiently catalyzing the 

k . amplification by PCR (the polymerase chain reaction) of 

'Z. unusually long and faithful products. 

15 DNA polymerase obtained from the hot springs 

bacterium Thermus aquaticus (Taq DNA polymerase) has been 

:= demonstrated to be quite useful in amplification of DNA, 

in DNA sequencing, and in related DNA primer extension 
techniques because it is thermostable. Thermostable is 

20 defined herein as having the ability to withstand 

temperatures up to 95°C for many minutes without becoming 
irreversibly denatured, and the ability to polymerize DNA 
at high temperatures (60° to 75° C.) . The DNA and amino 
acid sequences described by Lawyer et al . , J. Biol. Chem. 

25 264:6427 (1989), GenBank Accession No. J04639, define the 

gene encoding Thermus aquaticus DNA polymerase and the 
enzyme Thermus aquaticus DNA polymerase as those terms 
are used in this application. The highly similar DNA 
polymerase (Tfl DNA polymerase) expressed by the closely 

30 related bacterium Thermus flavus is defined by the DNA 

and amino acid sequences described by 
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Akhmetz j anov, A. A. , and Vakhitov, V.A. (1992) Nucleic Acids 
Research 20:5839, GenBank Accession No. X66105 . These enzymes 
are representative of a family of DNA polymerases, also 
including Thermus thermophilus DNA polymerase, which are 
thermostable. These enzymes lack a 3 ' -exonuclease activity 
such as that which is effective for editing purposes in DNA 
polymerases such as E. coli DNA polymerase I, and phages T7, 
T3 , and T4 DNA polymerases . 

Gelfand et al ■ , U.S. Patent 4,889,818 describe a 
wild-type (abbreviation used here: WT) , native Thermus 
aquaticus DNA polymerase. Gelfand et al . , U.S. Patent 
5,079,352 describe a recombinant DNA sequence which 
encodes a mutein of Thermus aquaticus DNA polymerase from 
which the N- terminal 289 amino acids of Thermus aquaticus 
DNA polymerase have been deleted (claim 3 of '352, 
commercial name Stoffel Fragment, abbreviation used here: 
ST) , and a recombinant DNA sequence which encodes a 
mutein of Thermus aquaticus DNA polymerase from which the 
N- terminal 3 amino acids of Thermus aquaticus DNA 
polymerase have been deleted (claim 4 of '352, trade name 
AmpliTaq, abbreviation used here: AT) . Gelfand et al . 
report their muteins to be "fully active" in assays for 
DNA polymerase, but data as to their maximum 
thermostability is not presented. 

Amplification of DNA spans by the polymerase chain 
reaction (PCR) has become an important and widespread 
tool of genetic analysis since the introduction of 
thermostable Taq DNA polymerase for its catalysis. 
However, one remaining limitation to prior art methods of 
PCR is the size of the product span that can be 
amplified. For full-length Taq DNA Polymerase and for N- 
terminally truncated variants such as Klentaq-278, 
Klentaq5 and Stoffel Fragment, PCR amplification 
apparently rapidly becomes inefficient or non-existent as 
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the length of the target span exceeds 5-6 kb . This was 
shown even when 3 0 minutes was used during the extension 
step of each cycle. 

Although there are several reports of inefficient 
but detectable amplification at 9-10 kb target length and 
one at 15 kb, most general applications are limited to 5 
kb. 

Kainze et al . (Analytical Biochem. 202:46-49(1992)) 
report a PCR amplification of over 10 kb: a 10.9 kb and 
a 15.6 kb product, utilizing an enzyme of unpublished 
biological source (commercially available as "Hot Tub" 
DNA polymerase) . Kainze et al . report achieving a barely 
visible band at 15.6 kb after 30 cycles, starting with 1 
ng of A DNA template per 100 ul of reaction volume. The 
efficiency of this amplification was shown to be 
relatively low, although a quantitative calculation of 
the efficiency was not presented. Attempts by Kainze et 
al . to make WT Thermus aquaticus DNA polymerase perform 
in the 10-15 kb size range were not successful, nor have 
successful results been reported by anyone else for any 
form of Thermus aquaticus DNA polymerase in this size 
range . 

A DNA polymerase formulation capable of efficient 
amplification of DNA spans in excess of 6 kb would 
significantly expand the scope of applications of PCR. 
For instance, whole plasmids, and constructs the size of 
whole plasmids, could be prepared with this method, which 
would be especially valuable in cases in which a portion 
of the DNA in question is toxic or incompatible with 
plasmid replication when introduced into E. coli . If 
this thermostable DNA polymerase preparation 
simultaneously conferred increased fidelity to the PCR 
amplification, the resulting large products would be much 
more accurate, active and/or valuable in research and 
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applications, especially in situations involving 
expression of the amplifed sequence. If the thermostabl 
DNA polymerase preparation allowed, in addition, more 
highly concentrated yields of pure product, this would 
enhance the method of PCR to the point where it could be 
used more effectively to replace plasmid replication as 
means to produce desired DNA fragments in quantity. 
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SUMMARY OF THE INVENTION 

Among the several objects of the invention, therefore, 
may be noted the provision of a formulation of DNA polymerases 
capable of efficiently catalyzing primer extension products of 
5 greater length than permitted by conventional formulations, 
including lengths up to at least 35 kilobases, that reduces 
the mutagenicity generated by the PCR process, particularly in 
comparison with prior art DNA polymerases and for any target 
lengths, that maximizes the yield of PCR target fragments and, 

%0 concomitantly, enhances the intensity and sharpness of PCR 
product bands, without significant sacrifice in flexiblity, 
specificity, and efficiency; and the provision of an improved 
process for amplification by PCR which can be utilized to 
reliably synthesize nucleic acid sequences of greater length 

3,5 and which can effectively utilize PCR products as primers. 

Briefly, therefore, the present invention is 
directed to a novel formulation of thermostable DNA 
polymerases including at least one thermostable DNA polymerase 
lacking 3' -5' exonuclease activity and at least one 

20 thermostable DNA polymerase exhibiting 3 ' -5 ' exonuclease 
activity . 

In another aspect, a kit for the synthesis of a 
polynucleotide is provided, comprising a first DNA polymerase 
which posesses 3 ' -5 ' exonuclease activity, and a second DNA 

25 polymerase which lacks 3' -5' exonuclease activity. 

In another aspect, a kit for the synthesis of a 
polynucleotide is provided, comprising a first DNA polymerase 
which posesses 3' -5' exonuclease activity, and a second DNA 
polymerase which lacks 3 '-5' exonuclease activity, wherein the 

3 0 first DNA polymerase is selected from the group consisting of 
Pyrococcus furiosus DNA polymerase, Thermotoga maritima DNA 
polymerase, Thermococcus litoralis DNA polymerase, and 
Pyrococus GB-D DNA polymerase, and the second DNA polymerase 
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is selected from the group consisting of Thermus aquaticus DNA 
polymerase, (exo-) Thermococcus literalis DNA polymerase, 
(exo-) Pyrococcus f uriosus DNA polymerase, and (exo-) 
Pyrococcus GB-D DNA polymerase. 
5 In a further embodiment of the invention, a method 

of amplifying a polynucleotide sequence is provided. The 
method includes the steps of mixing a composition with a 
synthesis primer, and a synthesis template, with the 
composition including a first DNA polymerase possessing 3 '-5' 

10 exonuclease activity, and a second DNA polymerase lacking 3 ' - 
5' exonuclease activity. 

In yet another aspect of the invention, a method of 
amplifying a polynucleotide sequence is provided. The method 
includes the steps of mixing a composition with a synthesis 

15 primer, and a synthesis template, with the composition 

including a first DNA polymerase possessing 3 ' - 5 ' exonuclease 
activity which is selected from the group consisting of 
Pyrococcus furiosus DNA polymerase, Thermotoga maritima DNA 
polymerase, Thermococcus litoralis DNA polymerase, and 

2 0 Pyrococcus GB-D DNA polymerase, and a second DNA polymerase 
lacking 3 '5 exonuclease activity which is selected from the 
group consisting of Thermus aquaticus DNA polymerase, (exo-) 
Thermococcus litoralis DNA polymerase, (exo-) Pyrococcus 
furiosus DNA polymerase, and (exo-) Pyrococcus GB-D DNA 

2 5 polymerase. 

Other objects and features will be in part apparent 
and in part pointed out hereinafter. 

SUMMARY OF ABBREVIATIONS AND TERMS 

The listed abbreviations and terms, as used herein, 
30 are defined as follows: 

Abbreviations : 

bp = base pairs 

kb = kilobase; 1000 base pairs 

nt = nucleotides 
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BME = beta-mercaptoethanol 

PP 1 = sodium pyrophosphate 

In use, the following 3 -letter abbreviations often refer 
the single-chain DNA polymerase elaborated by the 
microorganism. 

pf u = Pyrococcus furiosus 

Pwo = Pyrococcus woesii 

Taq = Thermus aquaticus 

Tfl = Thermus flavus 

Tli = Thermococcus literalis 

Klentaq-nnn = N-terminally deleted Thermus aquaticus 
DNA polymerase that starts with codon nnn+1, although 
that start codon and the next codon may not match the WT 
sequence because of alterations to the DNA sequence to 
produce a convenient restriction site. 

WT = wild-type (full length) or deletion of 

only 3 aa 

aa = amino acid(s) 

ST = Stoffel fragment, an N-terminal 

deletion of Thermus aquaticus DNA polymerase that could 
be named Klentaq-288. 

-LA = Long and Accurate; an unbalanced mixture of 
two DNA polymerases, at least one lacking significant 3'- 
exonuclease activity and at least one exhibiting 
significant 3 1 -exonuclease activity. 

PCR = (noun) 1 . The Polymerase Chain 

Reaction 

2. One such reaction/amplification experiment. 
3. (verb) To amplify via the polymerase chain reaction, 
ul = microliter (s) 

ATCC = American Type Culture Collection 

Megaprimer = double -stranded DNA PCR product used 

as primer in a subsequent PCR stage of a multi-step 

procedure . 
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Deep Vent = DNA polymerase from Pyrococcus species 
GB-D; purified enzyme is available from New England 
Biolabs . 

Deep Vent exo- = mutant form of Deep Vent DNA 
5 polymerase lacking 3 ' (editing) -exonuclease . 

Vent = DNA polymerase from Thermococcus litoralis ; 
purified enzyme is available from New England Biolabs. 

Vent exo- = mutant form of Vent DNA polymerase 
lacking 3 ' (editing) -exonuclease. 
10 Pfu = DNA polymerase from Pyrococcus furiosus 

lacking 3 ' (editing) -exonuclease ; purified enzyme is 
available from Stratagene Cloning Systems, Inc. 

Pfu exo- = mutant form of Pfu DNA polymerase ; 
purified enzyme is available from Stratagene Cloning 
15 Systems, Inc. 

SEQUENASE = A chemically modified or a mutated form 
of phage T7 or T3 DNA polymerase wherein the modification 
or mutation eliminates the 3 ' -exonuclease activity. 

THESIT = polyethylene glycol monododecyl ether. 

2 0 BRIEF DESCRIPTION OF THE DRAWINGS 

Figures 1 (A-C) are depictions, respectively, of an 
agarose gel on which was loaded a portion of a test PCR 
experiment. Figures 1 (A-C) demonstrate the large 
increase in efficiency of large DNA span PCR achieved by 
25 variations of a preferred embodiment of the enzyme 

formulation of the invention. Although KlenTaq-278 or 
Pfu DNA polymerase, alone, are shown to catalyze a low 
level of 6.6 kb PCR product formation, various 
combinations of the two are seen to be much more 

3 0 efficient. Lower and lower amounts of Pfu in combination 

with Klentaq-278 are seen to be effective, down to the 
minimum presented, 1/640. Of those shown, only a 
combination of Klentaq-278 and Pfu can catalyze efficient 
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amplification of 6.6 kb. Per 100 ul, the indicated level 
of each enzyme (see Methods, Example 7, for unit 
concentrations) was used to catalyze PCR reactions 
templated with 19 ng Aplac5 DNA and primers MBL and MBR. 
20 cycles of 94° 2 min., 60° 2 min., 72° 10 min. 

Figure 2 is a depiction of an agarose gel on which 
were analyzed the products of PCR experiments to test the 
performance of an embodiment of the invention in 
catalyzing the amplification of fragments even longer 
than 6.6 kb . Figure 2 demonstrates the ability to 
amplify 8.4 kb, 12.5 kb, 15 kb, and 18 kb with high 
efficiency and large yield, utilizing the 1/640 ratio 
embodiment of the enzyme formulation of the invention. 
Target product size is indicated above each lane as kb : . 
Level of template per 100 ul is indicated as ng X: . 20 
or 30 cycles of PCR were each 2 sec. 94°, 11 min. 70°. 
These early amplifications were non-optimal in several 
respects compared to the current optimal procedure (see 
Methods, Example 7) : thick-walled tubes were employed 
instead of thin, catalysis was by 1 ul KlentaqLA-64 
(63:1: :Klentaq-278:Pfu) instead of KlentaqLA- 16 , the 
2 7mer primers were used (see Table 3) instead of longer 
primers, the extension/annealing temperature was 70° 
instead of 68°, and the Omnigene thermal cycler was used. 

Figure 3 is a depiction of an agarose gel of a PCR 
amplification attempted using a 384 bp megaprimer 
(double -stranded PCR product) paired with a 43-mer 
oligonucleotide primer BtV5 . Per 100 ul of reaction 
volume, the following enzymes (see Ex. 7, Methods, 
for unit concentrations) were used to catalyze 
amplifications: lane 1, 1 ul Pfu DNA polymerase; lane 2, 
1/16 ul Pfu; lane 3 , 1 ul Klentaq-278 ; lane 4, both 
enzymes together (1 ul Klentaq-278 + 1/16 ul Pfu) . The 
384 bp band near the bottom of the gel is the megaprimer, 
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which was originally amplified using Klentaq-278. AH3 = 
lambda DNA digested with Hindlll. The only successful 
amplification resulted from the combination of the two 
enzymes (lane 4) . Vent DNA polymerase could substitute 
for Pfu with the same result (data not shown) . 

Figure 4 is a depiction of an agarose gel 
demonstrating that 33mers are better than 27mers. Per 
100 ul of reaction volume, 2 ng (lanes 1-6) or 10 ng 
(lanes 7-12) of lambda transducing phage template were 
amplified using 27mer primers (lanes 1-3, 7-9) or 33mer 
primers (lanes 4-6, 10-12) . Besides being longer, the 
33mer lambda primer sequences were situated 100 bp to the 
left of primer MBL and 2 00 bp to the right of primer MBR 
on the lambda genome. KlentaqLA-16 in the amounts of 
1.2, 1.4, and 1.6 ul was used to catalyze the 
amplifications of 12.5, 15, and 18 kb, respectively. 15 
ul aliquots (equivalent to 0.3 or 1.5 ng of A template) 
were analyzed by 0.8% agarose electrophoresis. 

Figure 5 is a depiction of an agarose gel showing a 
CHEF pulse-field analysis (ref . 11, 4 sec. switching 
time) of large PCR products amplified by KlentaqLA-16 
(1.2 ul) under conditions which were suboptimal with 
respect to pH (unmodified PC2 buffer was used) and 
thermal cycler (Omnigene) . Starting template (see Table 
3) was at 0.1 ng/ul and the time at 68° in each cycle was 
21 min. for products over 2 0 kb, 13 min. for lanes 4 & 5, 
and 11 min. for lanes 11-14. The volumes of PCR reaction 
product loaded were adjusted to result in approximately 
equal intensity; in ul : 12,12,4,2; 10,10,10; 2,2,4,1. 
The standard size lanes (S) show full-length Aplac5 DNA 
(48645 bp) mixed with a Hindlll digest of A DNA. As for 
Table 1, the sizes in 5 figures are in base pairs, as 
predicted from the primer positions on the sequence of 
Aplac5 DNA, and sizes with decimal points are in kb, as 
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determined from this gel . 

Figure 6 is a depiction of an agarose gel of 2 8 kb 
and 35 kb products without (lanes 2,3) and with (lanes 
5,6) digestion by restriction enzyme Hindi I I . Before 
Hindi I I digestion, the 2 8 kb product was amplified with 
21 min. extension time per cycle, and the 3 5 kb product 
was cycled with 24 min. extension times, both in the 
RoboCycler at optimum pH (see Ex. 7, Methods) . Lanes S 
(1,4,7) contain markers of undigested Xplac5 and Hindlll- 
digested Xplac5 DNA. Figure 7 is a depiction of an 

agarose gel showing the results of a Pfu exo" mutant test. 
PCR amplification of 8.4 kb by 30 units (0.7 ug) of 
Klentaq-278 alone (lanes 1,7) and in combination with a 
very small admixture (1/16 ul or 1/64 ul , equivalent to 
1/6 or 1/2 5 unit) of archaebacterial Pfu wild type exo + 
DNA polymerase (+; lanes 2,3) or a mutant thereof lacking 
the 3 1 -exonuclease activity (-; lanes 4,5). Lane 6 is 
the result if 1 ul (2.5 units) of solely Pfu DNA 
polymerase (wt, exo + ) being employed. 

DESCRIPTION OF THE PREFERRED EMBODIMENT 

DNA polymerases such as those discussed in this 
application are commonly composed of up to three 
identifiable and separable domains of enzymatic activity, 
in the physical order from N-terminal to C-terminal, of 
5 ' -exonuclease , 3 1 -exonuclease , DNA polymerase. Taq DNA 
polymerase has never had a 3 1 -exonuclease , but certain 
mutations of its N-terminal portion lead to a deletion of 
its 5 ' -exonuclease activity. Other DNA polymerases 
mentioned, such as Pfu DNA polymerase, do not have the 
5 ' -exonuclease , but their 3 ' -exonuclease function is 
central to the aspect of the invention directed to 
mixtures of DNA polymerases El (lacking 3 ' -exonuclease 
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activity) and E2 (having 3 ' -exonuclease activity) . In 
these mixtures, the presence of 5 ' -exonuclease in either 
El or E2 has not been shown to be essential to the 
primary advantages of the present invention. 

Table 1 below depicts the nucleotide sequence of 
primers that can be used for amplification of the gene 
for a preferred embodiment of the DNA polymerase lacking 
3 ' -exonuclease activity (Klentaq-278) included as the 
primary component of the formulation of DNA polymerases 
of this invention. The bulk of the DNA sequence for the 
gene (between the primers) and the resultant amino acid 
sequence of the enzyme, is defined by the indicated 
GenBank entry. 

Table 1: Primers that amplify the gene for KlenTaq-278 



A. The primer at the 5' side of the target fragment. 
The start codon is indicated with *** . 



Ncol 10 2 0 3 0 KT1 3 6MER 

GAGCCATGGGCCTCCTCCACGAGTTCGGCCTTCTGG (SEQ ID NO : 1) 
" lllllllllllllllllllllllllll 

m g L L H E F G L L E . . . <-- upper case are WT 

aa 

278 280 282 284 286 288 <-- codon numbering for WT 

aa 

AGTTTGGCAGCCTCCTCCACGAGTTCGGCCTTCTGG . . . (SEQ ID NO: 14) 
959 969 979 TaqPol . seq GenBank entry 

Accession No. J04 63 9 (numbering includes 5' non- translated 
region) 

B. The primer at the 3' side of the target fragment. The two 
stop codons are indicated with ***. To demonstrate the 
homology, the other (complementary) strand of the actual primer 
is shown here. 

--other strand-- KLENTAQ32 3 5mer 
Hindi I I 
26 16 ****** 6 

GGACTGGCTCTCCGCCAAGGAGTAGTAAGCTTCGC (SEQ ID NO: 3) 
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Mill. ill II 

DWLSAKE* 
826 828 830 832 
GGAC TGGCTCTCCGC C AAGGAGTGATAC CAC C (SEQ ID NO: 15) 

2604 2614 2624 

TaqPol . seq 

Table 2 below depicts the nucleotide sequence of the same 
primers as in Table 1, and shows that these same primers can be 
used for amplification of the analogous gene from Thermus 
f lavus . 

Table 2 

The same primers as in Table 1 are homologous to Thermus 
f lavus DNA. 

10 20 30 

GAGCCATGGGCCTCCTCCACGAGTTCGGCCTTCTGG KT1 3 6MER 

*** Mllllllllllllllllllll I I I I 
mgLLHEFGLLE... <-- upper case are 

WT aa 

278 280 282 284 286 288 <-- codon numbering for 

WT aa 

AGTTTGGAAGCCTCCTCCACGAGTTCGGCCTCCTGG Tf 1 . seq GenBank 
entry 

1387 1397 1407 Accession number X66105 

(numbering includes 5 ' non-translated region) 

2 6 16 6 --other strand- - 

GGACTGGCTCTCCGCCAAGGAGTAGTAAGCTTCGC KLENTAQ32 3 5mer 

I II I I I I I I I I I I I I I I I I I I I I I I I I I 
DWLSAKE* 
826 828 830 
GGACTGGCTCTCCGCCAAGGAGTAGGGGGGTCCTG Tf 1 . seq 

3032 3042 3052 



Referring now to Table 1, the primers and logic for 
amplification by PCR of the recombinant DNA sequence 
encoding a preferred embodiment of the thermostable DNA 
40 polymerase of the invention lacking 3 ' -exonuclease 

activity (referred to herein as Klentaq-278) , are set 
forth. As depicted in Table 1, an initiator methionine 
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and a glycine residue occupy the first two N-terminal 
positions of Klentaq-278, previously occupied by residues 
279 and 2 80 of WT Thermus aquaticus DNA polymerase, 
followed by the amino acid sequence of wild-type Thermus 
5 aquaticus DNA polymerase, beginning with the amino acid 

residue at position 281 as described by Lawyer et al . 
The codons encoding amino acid residues 1 through 280 of 
Thermus aquaticus DNA polymerase are therefore deleted, 
and the amino acids 1 thru 2 80 are not present in the 

5=0 resulting gene product. 

The primers and logic for amplification of another 
preferred embodiment of the DNA polymerase of the 
invention lacking 3 ' -exonuclease activity are set forth 
in Table 2. In this embodiment, the same deletion 

15 mutation described above is made to the highly analogous 

enzyme Thermus flavus DNA polymerase . 

The mutant DNA polymerase Klentaq-278 exhibits 
thermostability at temperatures above those reported for 
previous variants of Thermus aquaticus DNA polymerase and 

2 0 has demonstrated a fidelity in final PCR products which 

is greater than that of WT Thermus aquaticus DNA 
polymerase, when both are utilized at the 72° C 
temperatures recommended for DNA synthesis. 

A vector is also provided which includes a 

2 5 recombinant DNA sequence encoding a DNA polymerase 

comprising the amino acid sequence of Thermus aquaticus 
or Thermus flavus DNA polymerase, except that it adds a 
methionine and glycine residue at the N-terminal and 
excludes the N-terminal 280 amino acids of wild-type 

30 Thermus aquaticus DNA polymerase (see Lawyer et al . , 

supra ) . 

In preferred embodiments, the vector is that nucleic 
acid present as plasmid pWB254b (SEQ ID NO: 5) deposited 
as ATCC No. 69244 or a host cell containing such a 
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vector. 

In a related aspect, the invention features purified 
DNA polymerases of the type described herein. As used 
in this application, "purified" means that the polymerase 
of the invention is isolated from a majority of host cell 
proteins normally associated with it. Preferably, the 
polymerase is at least 10% (w/w) of the protein of a 
preparation. Even more preferably, it is provided as a 
homogeneous preparation, e.g., a homogeneous solution. 

In general, the recombinant DNA sequence encoding 
for a preferred DNA polymerase lacking 3 1 -exonuclease 
activity which serves as the primary component of the DNA 
polymerase formulation of the present invention is 
amplified from a Thermus aquaticus genomic DNA or from a 
clone of the portion of the Thermus aquaticus DNA 
polymerase gene which is larger than the desired span, 
using the polymerase chain reaction (PCR, Saiki et al . , 
Science 239:487, 1988), employing primers such as those 
in Table 1 into which appropriate restriction sites have 
been incorporated for subsequent digestion. 

The recombinant DNA sequence described above is then 
cloned into an expression vector using procedures well 
known to those in this art. Specific nucleotide 
sequences in the vector are cleaved by site-specific 
restriction enzymes such as Ncol and Hindlll. Then, 
after optional alkaline phosphatase treatment of the 
vector, the vector and target fragment are ligated 
together with the resulting insertion of the target 
codons in place adjacent to desired control and 
expression sequences. The particular vector employed 
will depend in part on the type of host cell chosen for 
use in gene expression. Typically, a host -compatible 
plasmid will be used containing genes for markers such as 
ampicillin or tetracycline resistance, and also 
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containing suitable promoter and terminator sequences . 

In a preferred procedure, the recombinant DNA 
expression sequence described above is cloned into 
plasmid pWB253 (expresses KlenTaq-235 deposited as ATCC 
5 No. 68431) or pWB250 (expresses lucif erase/NPTI I fusion), 

the backbone of which is pTAC2 (J. Majors, Washington 
University), a pBR322 derivative. The specific sequence 
of the resulting plasmid, designated pWB254b is SEQ ID 
NO : 5 . 

It) Bacteria, e.g., various strains of E. coli , and 

yeast, e.g., Baker's yeast, are most frequently used as 
host cells for expression of DNA polymerase, although 
techniques for using more complex cells are known. See, 
e.g., procedures for using plant cells described by 

1* Depicker, A., et al . , J . Mol . Appl . Gen . (1982) 1:561. 

E. coli host strain X7029, wild-type F" , having deletion 
X74 covering the lac operon is utilized in a preferred 
embodiment of the present invention. 

A host cell is transformed using a protocol designed 

20 specifically for the particular host cell. For E. coli , 

a calcium treatment, Cohen, S.N., Proc . Natl. Acad. Sci. 
69:2110 (1972), produces the transformation. 
Alternatively and more efficiently, electroporation of 
salt -free E. coli is performed after the method of Dower 

25 et al. (1988), Nucleic Acids Research 16:6127-6145. 

After transformation, the transformed hosts are selected 
from other bacteria based on characteristics acquired 
from the expression vector, such as ampicillin 
resistance, and then the transformed colonies of bacteria 

30 are further screened for the ability to give rise to high 

levels of isopropylthiogalactoside (IPTG) -induced 
thermostable DNA polymerase activity. Colonies of 
transformed E. coli are then grown in large quantity and 
expression of Klentaq-278 DNA polymerase is induced for 
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isolation and purification. 

Although a variety of purification techniques are 
known, all involve the steps of disruption of the E. coli 
cells, inactivation and removal of native proteins and 
precipitation of nucleic acids. The DNA polymerase is 
separated by taking advantage of such characteristics as 
its weight (centrifugation) , size (dialysis, 
gel-filtration chromatography) , or charge (ion-exchange 
chromatography) . Generally, combinations of these 
techniques are employed together in the purification 
process. In a preferred process for purifying 
Klentaq-278 the E. coli cells are weakened using lysozyme 
and the cells are lysed and nearly all native proteins 
are denatured by heating the cell suspension rapidly to 
8 0° C and incubating at 80-81° C for 2 0 minutes. The 
suspension is then cooled and centrifuged to precipitate 
the denatured proteins. The supernatant (containing 
Klentaq-278) then undergoes a high-salt 
polyethylene -innine treatment to precipitate nucleic 
acids. Centrifugation of the extract removes the nucleic 
acids. Chromatography, preferably on a heparin- agarose 
column, results in nearly pure enzyme. More detail of 
the isolation is set forth below in Example 3. 

The novel DNA polymerase formulation of the present 
invention may be used in any process for which such an 
enzyme formulation may be advantageously employed. This 
enzyme formulation is particularly useful for PCR 
amplification techniques, but may also be used for other 
processes employing DNA polymerases such as nucleic acid 
sequencing, cycle sequencing, DNA restriction digest 
labelling and blunting, DNA labelling, in vivo DNA 
f ootprinting, and primer-directed mutagenesis. 
Amplification 

Polymerase chain reaction (PCR) is a method for 
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rapidly amplifying specific segments of DNA, in geometric 
progression, up to a million fold or more. See, e.g., 
Mullis U.S. Patent No. 4,683,2 02, which is incorporated 
herein by reference. The technique relies on repeated 
cycles of DNA polymerase-catalyzed extension from a pair 
of primers with homology to the 5 ' end and to the 
complement of the 3 ' end of the DNA segment to be 
amplified. A key step in the process is the heat 
denaturing of the DNA primer extension products from 
their templates to permit another round of amplification. 
The operable temperature range for the denaturing step 
generally ranges from about 93°C to about 95oc, which 
irreversibly denatures most DNA polymerases, 
necessitating the addition of more polymerase after each 
denaturation cycle. However, no additional DNA 
polymerase needs to be added if thermostable DNA 
polymerases such as Thermus aquaticus DNA polymerase are 
used, since they are able to retain their activity at 
temperatures which denature double -stranded nucleic 
acids. As described in Example 4, below, Klentaq-278 has 
demonstrated the ability to survive meaningful repeated 
exposure to temperatures of 99°C, higher than for any 
previously known DNA polymerase. 

Deposit 

Strain pWB254b/X7029 was deposited with the American 
Type Culture Collection, Maryland, on February 18, 1993 
and assigned the number ATCC 69244. Applicant 
acknowledges his responsibility to replace this culture 
should it die before the end of the term of a patent 
issued hereon, 5 years after the last request for a 
culture, or 3 0 years, whichever is the longer, and his 
responsibility to notify the depository of the issuance 
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of such a patent, at which time the deposits will be made 
available to the public. Until that time the deposits 
will be made available to the Commissioner of Patents 
under the terms of 3 7 C.F.R. Section 1-14 nad 3 5 U.S.C. 
§112 . 

In the principal aspect of the invention, a target 
length limitation to PCR amplification of DNA has been 
identified and addressed. Concomitantly, the base pair 
fidelity, the ability to use PCR products as primers, and 
the maximum yield of target fragment were increased. 
These improvements were achieved by the combination of a 
DNA polymerase lacking significant 3 1 -exonuclease 
activity, preferably, Klentaq-278 described above, with a 
low level of a DNA polymerase exhibiting significant 3 ' - 
exonuclease activity (for example, Pfu, Vent, or Deep 
Vent) . Surprisingly, target fragments of at least 35 kb 
can be amplified to high yields from, for example, 1 ng 
lambda DNA template with this system. 

Moreover, products in the range 6.6 to 8.4 kb can be 
efficiently amplified by a formulation of thermostable 
DNA polymerases consisting of a majority component 
comprised of at least one thermostable DNA polymerase 
lacking 3 ' -exonuclease activity and a minority component 
comprised of at least one thermostable DNA polymerase 
exhibiting 3 ' -exonuclease activity, i.e., wherein the 
ratio of DNA polymerase lacking 3 ' -exonuclease activity 
to that exhibiting 3 ' -exonuclease activity exceeds 1 to 
1, measured by DNA polymerase activity units (or by 
weight where the DNA polymerase activity of the 3 ' - 
exonuclease activity-exhibiting enzyme has been 
eliminated, as described below) . 

The prior art technology only allowed relatively 
inefficient and sporadic amplification of fragments in 
this size range, resulting in only relatively faint 
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product bands or no detectable product at all. In light 
of the current discovery, I believe I understand the 
reason for the inefficiency of the prior art. Withouth 
limiting myself to any particular theory, it is believed 
that Thermus aquaticus DNA polymerase and its variants 
are slow to extend a mismatched base pair (which they 
cannot remove since they lack any 3 ' -exonuclease) . A 
couple of companies (New England Biolabs and Stratagene) 
have introduced thermostable enzymes which exhibit a 3 ' - 
(editing) exonuclease which should, one would think, 
allow the removal of mismatched bases to result in both 
efficient extension and more accurately copied products. 
In practice, these two enzymes (Vent and Pfu DNA 
polymerase) are unreliable and much less efficient than 
expected. One possible explanation for the unreliability 
of these enzymes for PCR is that the 3 ' -exonuclease often 
apparently attacks and partially degrades the primers so 
that little or no PCR is possible. This primer attack 
problem is worse for some primers than others. It has 
been reported (Anonymous, The NEB Transcript, New England 
Biolabs, (March, 1991) p. 4.) that the Vent DNA 
polymerase leaves the 5' 15 nt intact, so that if the 
annealing conditions allow that 15 nt to prime, PCR could 
presumably proceed. This would of course only allow 
annealling at lower, non-selective temperatures, and the 
5 1 15 nt of the primers must be exactly homologous to the 
template . 

I have discovered that the beneficial effects of a 
3 ' -exonuclease can be obtained with an unexpectedly 
minute presence of one or more DNA polymerases which 
exhibit 3 ' -exonuclease activity (herein called "E2") such 
as certain Archaebacterial DNA polymerases, whilst 
efficient extension is being catalyzed by a large amount 
of one or more DNA polymerases which lack 3 ' -exonuclease 
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activity, such as Klentaq-278 or AT (herein called "El") . 
As a minority component of a formulation or mixture of 
DNA polymerases, the unreliability and inefficiency of 
the 3 ' -exonuclease DNA polymerase, discussed above, is 
substantially reduced or eliminated. Moreover, since it 
is believed that the 3 1 -exonuclease is removing 
mismatches to eliminate pausing at the mismatches, the 
resulting DMA exhibits fewer base pair changes, which is 
a valuable decrease in the mutagenicity of PCR without 
sacrificing flexibility, specificity, and efficiency. In 
fact, the combination, even for KlenTaq-278/Pf u units 
ratios as high as 2 000, exhibited greatly increased 
efficiency of amplification. For most applications, the 
mixture of DNA polymerases must be at a relative DNA 
polymerase unit ratio of El to E2 of at least about 4:1, 
before enhanced product length and yield can be achieved. 
When Pfu DNA polymerase was used in the formulation, the 
ratio preferably is in the range 80 to 1000 parts 
KlenTaq-278 per part (unit) Pfu, more preferably from 
about 150 to about 170:1, and most preferably, is about 
160:1, depending somewhat on the primer- template 
combination. Similar ratios are preferred for mixtures 
of Pfu and Klentaq-291. 

If Deep Vent is substituted for Pfu for use in 
combination with Klentaq-278 or -291, the most preferred 
ratios for most applications increases to from about 450 
to about 500:1 El to E2 ; if full-length (WT) Taq or 
Amplitaq is included as El, the most preferred ratio to 
Pfu or other E2 component is between about 10 and about 
15 : 1 of El to E2 . 

E2 of the invention includes, but is not limited to, 
DNA polymerase encoded by genes from Pfu, Vent, Deep 
Vent, T7 coliphage, Tma, or a combination thereof. El of 
the invention includes, but is not limited to, a mutant, 
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3 ' -exonuclease negative form of an E2 DNA polymerase, or 
alternatively, a DNA polymerase which, in unmutated form, 
does not exhibit significant 3 ' -exonuclease activity, 
such as the DNA polymerases encoded by genes from Taq, 
Tfl, or Tth, or a combination thereof. 

As discussed below, the formulation of DNA 
polymerases of the present invention also includes 
formulations of DNA polymerase wherein El comprises a 
reverse transcriptase such as SEQUENASE . 

Additional examples of the formulations of the 
present invention include mixtures wherein El comprises 
or consists of a mutant or chemical modification of T7 or 
T3 DNA polymerase and E2 comprises or consists of a wild- 
type T7 or T3 DNA polymerase, or, in another variation, 
El comprises or consists of a Vent DNA polymerase lacking 
3 ' -exonuclease activity (sold by New England Biolabs as 
Vent exo-) and E2 comprises or consists of Vent. 

The principal here discovered, namely the use of low 
levels of 3 1 exonuclease during primer extension by a DNA 
polymerase lacking 3' exonuclease, is preferably employed 
using thermostable DNA polymerases, but is applicable to 
general DNA polymerase primer extensions, including 
normal temperature incubations (i.e. using non- 
thermostable DNA polymerases) and including reverse 
transcriptase enzymes, which are known to lack a 3 1 - 

(editing) exonuclease (Battula & Loeb, 1976) . An example 
of the former is the use of SEQUENASE (exo-) as the 
majority enzyme, and wild-type T7 DNA polymerase (exo+) 
or Klenow fragment as the minority component. An example 
of the latter is AMV (Avian Myoblastosis Virus) or MLV 

(Murine Leukemia Virus) Reverse Transcriptase as the 
major component, and Klenow fragment, T7 DNA polymerase, 
or a thermostable DNA polymerase such as Pfu or Deep Vent 
as the minor component. Because of the lower activity of 
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thermostable DNA polymerases at the temperatures of 3 7 
degrees and 42 degrees used by these reverse 
transcriptases, higher levels are likely to be required 
than are used in PCR. Although Klenow fragment DNA 
polymerase is not a preferred DNA polymerase using RNA as 
a template, it does function to recognize this template 
(Karkas, 1973; Gulati, Kacian & Spiegelman, 1974), 
particularly in the presence of added Mn ion. Added Mn 
ion is routinely used to achieve reverse transcription by 
thermostable DNA polymerase Tth, unfortunately (in the 
prior art) without the benefit of an exo+ component. It 
must be stressed that for the use of the exo+ component 
for reverse transcriptase reactions, extra care must be 
taken to ensure that the exo+ component is entirely free 
of contaminating RNAse . 

The following references describe methods known in 
the art for using reverse transcriptases, and are hereby 
incorporated by reference. 

Bat tula N. Loeb LA. On the fidelity of DNA 
replication. Lack of exodeoxyribonuclease activity and 
error-correcting function in avian myeloblastosis virus 
DNA polymerase. Journal of Biological Chemistry . 
251 (4) 1982-6, 1976 Feb 25. 

Gulati SC. Kacian DL . Spiegelman S. Conditions 
for using DNA polymerase I as an RNA- dependent DNA 
polymerase. Proceedings of the National Academy of 
Sciences of the United States of America . 71 (4) : 1035-9, 
1974 Apr. 

Karkas JD . Reverse transcription by Escherichia 
coli DNA polymerase I. Proc Natl Acad Sci USA . 
70 (12) :3834-8, 1973 Dec. 
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DNA Polymerase with no polymerase activity/ only 3 1 - 
exonuclease activity: 

While not limiting myself to a particular theory, 
applicant believes that the enzymatic activity of value 
in the minor (E2) component is the 3 ' -exonuclease 
activity, not the DNA polymerase activity. In fact, it 
is further believed that this DNA polymerase activity is 
potentially troublesome, leading to unwanted synthesis or 
less accurate synthesis under conditions optimized for 
the majority (El) DNA polymerase component, not the 
minority one. As taught by [Bernad, Blanco and Salas 
(1990) Site-directed mutagenesis of the YCDTDS amino acid 
motif of the phi 29 DNA polymerase, Gene 94:45-51.] who 
mutated the "Region I" DNA conserved DNA polymerase motif 
of phi 29 DNA polymerase, either Region III or Region I 
of the Pfu DNA polymerase gene are mutated, which has 
been sequenced by Uemori,T., Ishino,Y., Toh, H . , Asada,F. 
and Kato,I. Organization and nucleotide sequence of the 
DNA polymerase gene from the archaeon Pyrococcus 
furiosus, Nucleic Acids Res. 21, 259-265 (1993) . 

The following examples illustrate the invention. 

EXAMPLE 1 

Construction of an Expressible Gene for Klentaq-2 78 

In order to construct the Klentaq-278 DNA polymerase 
gene having a recombinant DNA sequence as described 
above, the following procedure was followed. 

The mutated gene was amplified from 0.25 ug of total 
Thermus aquaticus DNA using the polymerase chain reaction 
(PCR, Saiki et al . , Science 239:487, 1988) primed by the 
two synthetic DNA primers of Table 1. Primer KT1, SEQ ID 
NO : 1 , has homology to the wild-type DNA starting at codon 
280; this primer is designed to incorporate a Nco l site 
into the product amplified DNA. Primer Klentaq32, SEQ ID 
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NO: 3, a 33mer spanning the stop codon on the other strand 
of the wild-type gene encoding Thermus aquaticus DNA 
polymerase, and incorporating a Hin di I I site and a double 
stop codon into the product DNA. 
5 The buffer for the PGR reaction was 20 mM Tris HCl 

pH 8.55, 2.5 mM MgCl 2 , 16 mM (NH 4 ) 2 S0 4 , 150 ug/ml BSA, and 
200 uM each dNTP . The cycle parameters were 2' 95°, 2 1 
65o, 5' 72o. 

In order to minimize the mutations introduced by PCR 
1"0 (Saiki et al . , supra ) , only 16 cycles of PCR were 

performed before phenol extraction, ethanol 
precipitation, and digestion with the restriction enzymes 
Ncol and Hin di I I . 

EXAMPLE 2 

15 Preparation of an Expression Vector 

The product Nco l and Hin di I I fragment was cloned 
into plasmid pWB2 54b which had been digested with Nco l , 
Hind i I I , and calf intestine alkaline phosphatase. The 
backbone of this plasmid, previously designated pTAC2 and 

20 obtained from J. Majors, carries the following elements 

in counter-clockwise direction from the PvuII site of 
pBR322 (an apostrophe ' designates that the direction of 
expression is clockwise instead of counter clockwise) : a 
partial lac Z ' sequence, lac l ' , lac PUV5 (orientation not 

2 5 known) , two copies of the tac promoter from PL 

Biochemicals Pharmacia-LKB ; catalog no. 27-4883), the T7 
gene 10 promoter and start codon modified to consist of a 
Nco l site, a Hind i I I site, the trpA terminator (PL no. 
27-4884-01) , an M13 origin of replication, and the Amp R 

30 gene of pBR322 . Expression of the cloned gene is 

expected to be induced by 0.1 mM IPTG. 

Ampicillin-resistant colonies arising from the clon- 
ing were assayed by the single colony thermostable DNA 
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polymerase assay of Sagner et al . (1991) [GENE 97:119-23] 
and 4 strong positives were sized by the toothpick assay 
(Barnes, Science 195:393, 1977). One of these, number 
2 54.7, was of the expected size except for a small pro- 
portion of double insert. This plasmid was further puri- 
fied by electroporation into E. coli X7029 and screened 
for size by the toothpick assay, and one plasmid of the 
expected size with no double insert contamination was 
designated pWB2 54b. This plasmid was used for the pro- 
duction of Klentaq-278 described herein. 

EXAMPLE 3 

Purification of Large Amounts of Klentaq-2 78 
Plasmid pWB2 54 has a double (tandem repeat) tac 
promoter and the T7 gene 10 leader sequence, an ATG start 
codon, a glycine codon and then codons 280-832 of Thermus 
aquaticus DNA polymerase, then a tandem pair of stop 
codons followed by the trp transcription terminator. The 
pBR322 -based plasmid vector (pTac2 from John Majors) is 
ampicillin resistant. The cells are grown on very rich 
medium (see below) . Bacterial host X7029 is wild-type F" 
E. coli except for deletion X74 of the lac operon. 

Medium: Per liter water, 100 mg ticarcillin (added 
when cool), 10 g Y.E., 25 g. Tryptone, 10 g. glucose, 
1XM9 salts with no NaCl (42 mM Na 2 P0 4 , 22 mM KH 2 P0 4 ,19 mM 
NH 4 C1) . Do not autoclave the glucose and the 10XM9 to- 
gether; instead, autoclave one of them separately and mix 
in later. Adjust pH to 8 with 5 M NaOH (about 1 ml) . 
Add IPTG to 0.1 i« at OD 5S0 = 1 or 2 , and shake well at 
30° C. From OD = 2 up to 8 or 10, every half hour or so 
do the following: 

1. Read the pH with pH sticks 5-10. Adjust to pH 
8.5 with 5 M NaOH and swirling (2 to 5 ml per liter) 
whenever the pH falls below 8. 
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2. Read and record the OD 550 , usually as a 1/10 or 
1/50 dilution. 

3. This addition of glucose is optional and not 
necessarily of any value (evaluation of this question is 
incomplete at this time.) Read the glucose level with 
glucose sticks, and add an additional 0.5% (10 ml of 50%) 
if the level falls below 0.2%. 

If it is late, the cells can shake at 30° C all 
night after the last pH adjustment. Alternatively, set 
them in the cold room if they have not grown much in a 
few hours . 

Concentrate the cells e.g. by centrif ugation in a 
GS3 rotor for 8 minutes at 8 krpm. Pour off the 
supernatant and add culture to spin more down onto the 
same pellets. 
Lysis : 

Resuspend the cells in milliliters of TMN buffer 
equal to twice the packed cell weight in grams: (50 mM 
Tris-HCl pH 8.55, 10 mM MgCl 2 , 16 mM (NH 4 ) 2 S0 4 ). 

To each 3 00 ml of cell suspension add 6 0 mg lysozyme 
and incubate the cells at 5-10° C. with occasional swirl- 
ing for 15 minutes. Then add NP40 or Triton X100 to 
0.1%, and Tween 20 to 0.1%, by adding 1/100 volume of a 
solution of 10% in each. Then heat the cell suspension 
rapidly to 80° C. by swirling it in a boiling water bath, 
then maintain the cells (fast becoming an extract) at 
80-81° C. for 2 0 minutes. Use a clean thermometer in the 
cells to measure temperature. Be sure the flask and bath 
are covered, so that even the lip of the flask gets the 
full heat treatment. After this treatment, which is 
expected to have inactivated all but a handful of en- 
zymes, cool the extract to 3 7° C. or lower in an ice bath 
and add 2 ml of protease inhibitor (100 mM PMSF in 
isopropanol) . From this point forward, try not to con- 
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tact the preparation with any flask, stir bar, or other 
object or solution that has not been autoclaved. (Deter- 
gents and BME are not autoclavable . The PEI and ammonium 
sulfate are also not autoclaved.) The purpose of the 
autoclaving is not only to avoid microbial contamination, 
but also to avoid contamination with DNA or nucleases. 

Distribute into centrifuge bottles and centrifuge at 
2° C. (for instance, 3 0 minutes at 15 krpm in a Sorval 
SS-34 rotor or 14 h at 4 krpm in a GS3 rotor) . The 
supernatant is designated fraction I, and can be assayed 
for DNA polymerase activity. 
High- salt PEI precipitation 

After rendering fraction I 0.25 M in NaCl (add 14.6 
g per liter) , add five percent Polymin-P (PEI, 
polyethylene -imine, Sigma) dropwise with stirring on ice 
to precipitate nucleic acids. To determine that adequate 
Polymin-P has been added, and to avoid addition of more 
than the minimum amount necessary, test 1/2 ml of centri- 
fuged extract by adding a drop of Polymin-P, and only if 
more precipitate forms, add more Polymin-P to the bulk 
extract, mix and retest . Put the test aliquots of ex- 
tract back into the bulk without contaminating it. 

To confirm that enough PEI has been added, centri- 
fuge 3 ml and aliquot the supernatant into 1/2 ml 
aliquots. Add 0, 2, 4, 6 or 10 ul of 5% PEI. Shake, let 
sit on ice, and centrifuge in the cold. Load 15 ul of 
these aliquot supernatants onto an agarose gel containing 
ethidium bromide and electrophorese until the blue dye 
has travelled 2 cm. Inspect the gel on a UV light box 
for detectable DNA or RNA in the supernatant. For the 
bulk extract, use about 1/100 volume (i.e. 2-3 ml for a 
3 00 ml extract) excess 5% PEI over the minimum necessary 
to remove all DNA by the agarose gel test. 

Stir in the cold for at least 15 minutes. 
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Centrif ligation of the extract then removes most of the 
nucleic acids. Keep the supernatant, avoiding any trace 
of the pellet. 

Dilute the PEI supernatant with KTA buffer until the 
conductivity is reduced to at or below the conductivity 
of KTA buffer with added 22 mM ammonium sulfate. (Check 
conductivity of 1/4 0 dilution compared to similar dilu- 
tion of genuine 22 mM A.S. in KTA.) Usually this is 
about a 5-fold dilution. 

Chromatography with Bio-Rex 70 (used by Joyce & 
Grindley) (Joyce, CM. & Grindley, N.D.E. (1983) Con- 
struction of a plasmid that overproduces the large 
proteolytic fragment (Klenow fragment) of DNA polymerase 
I of E. coli , Proc. Natl. Acad. Sci . U.S.A. 80, 
1830-1834) is unsuccessful (no binding) , but unavoidable, 
since without it, the next column (heparin agarose) will 
not work efficiently. I believe that the important func- 
tion of the Bio-Rex 70 step is to remove all excess PEI, 
although it is possible that some protein is removed as 
well. CM-cellulose does not substitute for Bio-Rex 70. 

Pass the diluted PEI supernatant through equili- 
brated Bio-Rex 70 (10 ml per 100 g. cells) . The polymer- 
ase activity flows through. Rinse the column with 2 
column volumes of 22 mM A.S. / KTA. Our procedure is to 
set up the following heparin agarose column so that the 
effluent from the Bio-REX 70 column flows directly onto 
it . 

Heparin Agarose Chromatography (room temperature, 
but put fractions on ice as they come off.) 

Load the Bio-Rex flow-through slowly onto heparin 
agarose (Sigma; 10 ml per 100 grams of cells [this could 
be too little heparin agarose] .) Wash with several col- 
umn volumes of KTA + 22 mM A.S., then three column vol- 
umes of KTA + 63% glycerol + 11 mM A.S., then elute the 
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pure enzyme with KTA + 63% glycerol + 222 mM A.S. + 0.5% 
THESIT (this is more THESIT for the final eluate.) 

Pool the peak of polymerase activity or OD 280 / (starts 
about at 2/3 of one column volume after 222 mM starts, 
5 and is about 2 column volumes wide) . Store pool at -2 0° 

C. 

The storage buffer is a hybrid of, and a slight 
variation of, AmpliTaq storage buffer as recommended by 
Perkin-Elmer Cetus and Taq storage buffer used by 

10 Boehringer-Mannheim: 50% glycerol (v/v; 63% w/v) , 222 mM 

ammonium sulfate (diluted to about 50 mM for bench- 
strength samples), 20 mM Tris-HCl pH 8.55,0.1 mM EDTA, 10 
mM mercaptoethanol , 0.5% THESIT) . 

The THESIT causes some thickening and cloudiness 

15 below -10° C. This seems to cause no harm, but we sug- 

gest you warm the enzyme to 0° C. on ice before 
aliquoting for use. THESIT replaces the combination of 
0.5% Triton-XlOO, 0.5% Tween 20, which you may want to 
consider as an alternative. 

2 0 I have had sporadic reports that freezing can 

inactivate the enzyme. Exercise caution in this 
regard. This question is under current investiga- 
tion. Storage at -80° (after quick-cooling with 
liquid nitrogen) is being tested and looks promis- 

2 5 ing, but more than one freeze- thaw cycle has been 

deleterious to the enzyme preparation on some occa- 
sions . 

Our final yield of enzyme from 7 liters (100 g 
cells) was once 28 ml at a concentration of 120,000 
30 units per ml (4 x bench- strength) . 

1/4 ul of bench- strength enzyme will support 
the PCR of a 2 kb span of DNA in a 100 ul reaction. 
Template is 5-10 ng of plasmid DNA. Each cycle 
consists of 1 min 98° C, 1 min 65° C, 6 rain 72° C. 
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Cycle number is 16-20. Less enzyme is needed for 
smaller-sized products (1/8 ul for 500 bp) and more 
enzyme is needed for larger products (1 ul for 5 
kb) . 



KTA Buffer per liter 



20 mM Tris 8.55 10 ml of 2 M 

10 mM BME 0 . 7 ml neat 

10% w/v Glycerol 100 g. 

0 . 1 mM EDTA 0.2 ml of .5 M 

0.1% w/v THESIT 10 ml of 10% 

Rough Incorporation Assay 

1 X PC2 Buffer (20 mM Tris-HCl pH 8.55, 2.5 mM MgCl 2 , 

16 mM (NH 4 ) 2 S0 4 , 10 0 ug/ml BSA) 

2 00-25 0 ug/ml activated salmon sperm DNA 

40 uM each dNTP + 10-50 uCi a- 32 P-dATP per ml 

To 2 5 ul assay mix on ice add 0.2 ul of enzyme 
fraction, undiluted, or diluted in 8 ul of 1XPC2 
buffer (or a 1/5 or 1/25 dilution thereof.) 
Prepare standard Klentaq or Amplitaq, zero enzyme 
and total input samples, also. Incubate 10 min. at 
72° C, then chill. Spot 5 or 8 ul onto filter 
paper and wash twice for 5-10 min. with 5% TCA, 1% 
PPi. If pieces of paper were used, count each using 
Cerenkov radiation or hand monitor. If a single 
piece of 3 MM paper was used, autoradiograph for 
60 ' . 

PCR Assay to give 2 kb product. 

Make up 1 ml of PCR reaction containing 50 ng 
of plasmid pLc (a clone of an R color control cDNA 
from maize. PNAS 86:7092; Science 247:449), 200 
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pmoles each of primers Lc5 (SEQ ID NO: 11) and Lc3 
(SEQ ID NO:12), PC2 buffer and 200 uM dNTPs, but no 
enzyme . 

Distribute 100 ul into tube one, and 5 0 ul into 
the rest of 8-10 tubes. Add 1 ul of final pool of 
KlenTaq to tube one and mix. Then remove 5 0 ul to 
tube two and mix that, and so on down the series, 
which will then contain decreasing amounts of enzyme 
in two-fold steps. Cover each 50 ul reaction with a 
drop of mineral oil, spin, and PCR 16 cycles at 2' 
95° C, 2' 65° C, 5' 72° C. 
Final Bench- Strength KlenTaq- 27 8 Enzyme 

Using 63% glycerol / KTA (.5% THESIT) buffer 
with 222 mM ammonium sulfate, dilute the pool 
conservatively so that 1/4 ul should easily catalyze 
the amplification the 2 kb span by PCR. Do not 
decrease the ammonium sulfate concentration below 50 
mM. Store at -20° C. 

EXAMPLE 4 

A PCR amplification assay to produce 2 kb of 
DNA product was conducted using Thermus aquaticus 
DNA polymerase (AmpliTaq) and Klentaq-278. To test 
polymerase thermostability at elevated temperatures, 
the DNA denaturation step of the PCR amplification 
reactions were conducted for 2 min. at 97°c, 98°C 
and 99°C, respectively, using graduated 
concentrations of DNA polymerase. 

The amplification procedures used followed 
approximately the protocol for amplifying nucleic 
acid sequences outlined by Saiki et al . , Science 
23 9:487, 1988. A 1 ml reaction mixture was prepared 
containing 100 ng of plasmid pLC, 200 pmoles each of 
primers Lc5 (SEQ ID NO:ll) and Lc3 (SEQ ID NO:12), 
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reaction buffer (20 raM Tris-HCl pH 8.55, 16 raM 
ammonium sulfate, 2.5 mM MgCl 2 and 150 ug/ml BSA) , 
200 uM dNTPs , but no enzyme. 10 0 ul of the reaction 
mixture was placed into tubes. Aliquots of AmpliTaq 
5 and Klentaq-2 78 were then added and 2 0 cycles of PCR 

were undertaken. 

A range of enzyme concentrations was used in order 
to be able to detect small effects on the effective PCR 
catalysis activity. The template was 10 ng of pLc (a 

10 clone of an R color control cDNA from maize. PNAS 

86:7092, Science 247:449). The primers were Lc5 (SEQ ID 
NO: 11) and Lc3 (SEQ ID NO: 12) . 

= As a result of this experiment it was seen that 

98° C was not detectably detrimental to KlenTaq-2 78, 

15 yet AT was nearly completely inactivated by this 

temperature . 

EXAMPLE 5 

Efficent and Accurate PGR Amplification of 
Long DNA Targets: (Part A) 

2 0 A preferred embodiment of the above formulation 

(designated KlenTaq-LA) is provided as follows: 
Starting with the purified enzymes in storage 
buffer, mix 1 ul of Pfu DNA polymerase at 2.5 u./ul 
with 64 ul of KlenTaq-278 at 25 u./ul. Store at - 

25 20° C. 

Larger amounts of Pfu are detrimental to some 
PCR amplifications, perform equally for some, and 
are beneficial for some. For testing of the optimum 
level of Pfu, several reactions complete with 

30 KlenTaq-278 are aliquoted in the amount left to 

right of 75 ul, 2 5 ul, 2 5 ul , and as many additional 
25 ul aliquots as desired. Then 3/8 ul of Pfu 
(equivalent to 0.5 ul per 100 ul -- this is about 
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the most that one would ever want) is added to the 
leftmost, 75 ul reaction and mixed. Serial, two- 
fold dilutions are then made as 25 ul + 25 ul left 
to right along the row of tubes, adding no Pfu to 
the last one, as a control of KlenTaq-278 alone. A 
reaction of 1/2 or 1 ul (per 100 ul) of Pfu alone 
should also be run. 

Reaction buffer is PC2 as above, supplemented 
with 200 uM of each dNTP and 800 uM of MgCl 2 (total 
Mg ++ 3.3 mM) , and per 100 ul of reaction volume, 20 
pmoles of each primer MBL (SEQ ID NO: 7) and MBR (SEQ 
ID NO:8), and 30 ng of Aplac5 intact phage. Per 100 
ul of reaction volume, 1 or 1/2 ul of KTLA are 
effective levels of enzyme. Suitable PCR cycling 
conditions are two- temperature : 20 seconds at 94° 
C, 11 minutes at 70° C, for 20 cycles. Alternate 
cycling conditions include two- temperature PCR with 
1 minute at 98° C and 10 minutes at 65° C. 10 to 16 
ul are loaded onto an agarose gel for product 
analysis by staining with ethidium bromide. See 
Figure 1 for other details and variations. The 
template was Xplac5, which carries a portion of the 
lac operon region of the E. coli genome . Thirty ng 
of phage DNA were included in each 100 ul of 
reaction volume, introduced as intact phage 
particles. The primers are homologous to wild-type 
lambda DNA and amplify X DNA, not the lac DNA. 
Primer MBL No. 8757 (5' nucleotide matches base pair 
27914 of X DNA) is GCT TAT CTG CTT CTC ATA GAG TCT 
TGC (SEQ ID N0:7). Primer MBR No. 8835 (5 r 
nucleotide matches bp 34570 of X DNA) is ATA ACG ATC 
ATA TAC ATG GTT CTC TCC (SEQ ID NO: 8) . The size of 
the amplified product is therefore predicted to be 
6657 bp. 
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As shown in Figure 1A and IB, each DNA 
polymerase enzyme (KlenTaq-278 or Pfu) alone gives 
rise to a faint product band (except for some 
reactions, when Pfu alone does not work at all), but 
the combinations all give rise to product bands that 
are 20 to 50 times more intense than either enzyme 
can catalyze on its own. 

Figure 1C, second lane from the right, shows the 
surprising result of adding as little as 1/64 ul of Pfu 
to 1 ul of KlenTaq-278 (a units ratio of 1/640) . Not 
shown are data that as little as 1/200 ul (1/2000 in 
units) of Pfu contributed a noticeable improvement to the 
efficiency of this test amplification. 

Vent DNA polymerase required 10 -fold higher 
amounts (yet still minority amounts) for similar 
functionality. 

An additional, beneficial, and unexpected 
attribute to the PCR reactions catalyzed by KlenTaq- 
LA was a phenomenal, never previously observed 
intensity and sharpness to the PCR product bands. 
In part, this increased yield is manifested by a 
dark area in the middle of the bands as 
photographed. This darker area in the ethidium 
flourescence is believed to be due to UV absorbance 
by the outside portions of the band, reducing the 
potential UV-activated flourescence. The system 
apparently allowed a much greater yield of product 
then did the prior art, which tended to create a 
broad smear of product, and increasing amounts of 
side product, when amplification was allowed to 
proceed to this extent. 
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EXAMPLE 6 

Efficent and Accurate PCR Amplification of 
Loner DNA Targets: (Part B) 

Efficient amplification of 8.4 kb, 12.5 kb, 15 
kb, and 18 kb was demonstrated by the experiment 
depicted in Figure 2 . This experiment extended the 
demonstrated performance of the a preferred 
embodiment of the invention, 1/640 KlenTaq-LA, even 
further. The amplification was highly successful 
for the size range 8.4 to 15 kb, detectably 
successful for 18 kb, but not successful for an 
attempted 19.7 kb . 

Eight different PCR reactions were run in this 
experiment, differing from each other in the 
template or amount of template or in the primer pair 
employed, as shown in the legend on Figure 2 . Each 
reaction was divided 3 ways and cycled differently 
in parts A, B, and C. Between parts A and B, this 
experiment compared 2 0 cycles to 3 0 cycles at 94° 
denaturation phase. In parts B and C, this 
experiment compared 94° to 93° for 30 cycles. This 
experiment utilized 1.3 ul of Klentaq-LA (at a 
Klentaq-278/Pfu ratio of 640) per 100 ul of 
reaction. This may have been a little too much 
enzyme, since high enzyme has been associated in 
previous experiments with the catastrophic synthesis 
of product which cannot enter the gel, as occurred 
here for the reaction products in channels 2B and 
6C. At the current stage of development of long PCR 
using the invention, this poor outcome occurs about 
10% of the time. 

Comparing conditions B and C, it is apparent that 
somewhat lower denaturation temperature is desirable. 
This is consistent with similar experiments comparing 
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time at 94° C, in which yield of long PCR products was 
found to be decreased as the denaturation time increased 
in the order 2, 20, 60, and 180 seconds at 94° C for the 
denaturation step of each cycle. These data indicate 
that there was at least one weak link, i.e. least 
thermostable component, in the reactions which is subject 
to inactivation at 94°. Since 94° is below the 
temperature known to damage the DNA polymerase activity 
and the DNA, it is believed that it is not the 
thermolabile element. In an alternative embodiment of 
this aspect of the invention Pfu DNA polymerase is 
replaced as the minority component with a more 
thermostable 3 ' -exonuclease of a DNA polymerase such as, 
but not limited to, that from the Archaebacterium strain 
ES4 , which can grow at temperatures up to 114° C 
[Pledger, R.J. and Baross, J. A. , J. Gen. Microbiol. 137 
(1991)], which maximum growth temperature exceeds that of 
the source of the Pfu DNA polymerase (103° C . ; 
Blumentals, I.I. et al . (1990) Annals of the N . Y . Acad. 
Sci. 589:301-314.) 

In the experiment in Figure 2 the final intensity of 
the 15 kb band matched in only 20 cycles the yield 
obtained by Kainze et al . supra in 3 0 cycles for a band of 
similar size and from similar ADNA template amounts. 
This was a measure of the improved efficiency provided by 
the invention, and the further result was that the yield 
catalyzed by the invention in 30 cycles greatly exceeded 
the yield reported by these authors for 3 0 cycles. 
Accurate quantitation has not yet been carried out to 
measure the efficiency of the two methods, but inspection 
of Figure 2 compared to the figure published by Kainze et 
al . shows a yield for the 15 kb fragment that is 
estimated to be some 100 times higher. This corresponds 
approximately to a doubled efficiency of PCR extension. 
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EXAMPLE 7 

Efficent and Accurate PCR Amplification of 
Long DNA Targets: (Part C) 

Materials and Methods 



5 DNA Polymerases. DNA polymerases Vent and Deep Vent 

were supplied by New England Biolabs . Pfu DNA polymerase 
and its exo" mutant were supplied by Stratagene at 2.5 
units/ul. Klentaq-278 is an N-terminal deletion variant 
of Taq DNA polymerase as described above. Purified 

10 Klentaq-278 was as supplied by Ab Peptides, St. Louis, 

MO, USA at 25-35 units/ul (a protein concentration of 
about 0.7 ug/ul) . One unit of DNA polymerase activity 
incorporates 10 nmoles of nucleotide in 30 min. at 72° 
C, utilizing activated (partially degraded) calf thymus 

15 DNA as template. Since activated calf thymus DNA is a 

somewhat undefined substrate and is structurally 
different from PCR reaction substrate, this assay was 
routinely eschewed in favor of a PCR-based assay to set 
the above stock concentration of Klentaq-278: the 

20 concentration of Klentaq-278 stock was adjusted so that 

0.25 ul effectively (but .12 ul less effectively) 
catalyzes the amplification of a 2 kb target span from 10 
ng of plasmid substrate with cycling conditions including 
7 min. of annealing / extension at 65°. The mixture of 

25 15/16 ul Klentaq-278 + 1/16 ul Pfu DNA polymerases is 

designated KlentaqLA- 16 . 

Agarose gel electrophoresis employed 0.7% to 1% 
agarose in 1XGGB (TEA) buffer [40 mM Tris acetate pH 8.3, 
20 mM sodium acetate, 0 . 2 mM EDTA] at 2-3 v/cm, with 3% 

3 0 ficoll instead of glycerol in the loading dye. Figure 5 

employed 1% agarose pulsed-field CHEF (11) with a 
switching time of 4 sec. Standard DNA fragment sizes in 
every figure are, in kilobases (kb) : 23.1, 9.4, 6.6, 
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4.4, 2.3, 2.0, and 0.56. Figure 5 and 6 also have a 
full-length Aplac5 standard band, 48645 bp. 

All agarose gels were run or stained in ethidium 
bromide at 0.5 ug/ml and photographed (3 5 mm ASA 400 
black and white film) or videographed (Alpha Innotech or 
Stratagene Eagle Eye) under UV illumination. While 
printing the gel photographs, the left halves of Figures 
2 and 4 were exposed 50% less than the right halves. 

DNA primers are listed in Table 3 and in the 
Sequence Listing. 

Lambda DNA templates. AvacA, a gift from S. 
Phadnis, is a AEMBL4 -vectored clone of the cytotoxin gene 
region of Helicobacter pylori DNA. This DNA was 
extracted and stored frozen. The other phage template 
DNAs Xplac5 (12) and XK138 (13) were added as intact 
phage particles that had been purified by CsCl equlibrium 
centrifugation, dialyzed, and diluted in IX PC2 buffer. 

Long and Accurate PCR. PC2 Reaction buffer (10) 
consisted of 20 mM Tris-HCl pH 8.55 at 25°, 150 ug/ml 
BSA, 16 mM (NH 4)2 S0 4j 3 . 5 mM MgCl 2 , 2 50 uM each dNTP . For 
success above 28 kb (at 35 kb) , 1.5 ul of 2 M Tris base 
was added to each reaction, corresponding to pH 9.1 
measured for the Tris-HCl component only at 20 mM in 
water at 25° C. Contact with a pH probe was detrimental 
to the reactions, so pH was only measured on separate 
aliquots, and found to be 8.76 in the final reaction at 
25° C. Each 100 ul of reaction volume contained 20 
pmoles of each primer, and 0.1 to 10 ng of phage DNA 
template. 0.8 or 1.2 ul of KlentaqLA-16 was appropriate 
for under 2 0 kb and over 2 0 kb, respectively. Reaction 
volumes per tube were 33-50 ul , under 4 0 ul of mineral 
oil in thin-walled (PGC or Stratagene) plastic test 
tubes . 

PCR reactions utilizing the primers at the ends of X 
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required a preincubation of 5 min. at 68° -72° to disrupt 
the phage particles and to allow fill-in of the X sticky 
ends to complete the primer homology. Optimal cycling 
conditions were in a multiple-block instrument (Robo 
Cycler, Stratagene) programmed per cycle to 3 0 sec. 99°, 
30 sec. 67°, and 11 to 24 min. at 68°, depending on 
target length over the range shown in Table 3 . The 
second-best cycler was the Omnigene (HybAid) , programmed 
under tube control per cycle to 2 sec. at 95° , then 68° 
for similar annealing/extension times. Unless otherwise 
stated, all of the experiments reported here used 24 
cycles . 

For reported results of comparison of conditions 
such as cycling temperatures and times, thermal cycler 
machines, thick and thin-walled tubes, etc., reactions 
were made up as 100 ul complete and then split into 
identical aliquots of 33 ul before subjecting to PCR 
cycling. 
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Table 3. Primer and template combinations. 





Product 


Left 


Right 


Template 


5 


Size 


Primer 


Primer 


DNA 




5 . 8 


MBL101 


MS1933 


AK13 8 




6657 


MBL 


MBR 


Aplac5 




83 86 


MBL- 1 . 7 


MBR 






8 . 7 


MBR001 


AR3 6 


AK13 8 


10 


12 . 1 


lacZ333 


MBR2 02 


AK13 8 




12 .5 


MBL 2 7mer or 


MBR 2 7mer or 


AvacAI 






MBL101 3 3mer 


MBR2 02 3 3mer 






15560 


MSA19 2 8mer 


MBR2 02 


Aplac5 






MSA1933 33mer 






15 


18 . 0 


MBL101 


MBR2 02 


AK138 




19. 8 


L3 6 


MBL002 


AK138 




20707 


MBL101 


AR3 6 3 6mer 


Aplac5 




19584 


AL3 6 


lacZ333 


Aplac5 




13971 


MBR0 01 33mer 


AR3 6 


Aplac5 


20 


22 . 0 


AL3 6 


lacZ ' 533 


AK13 8 




24 . 6 


XL3 6 


MSA193 3 


AK13 8 




22495 


AL3 6 


lacZ536 


Aplac5 




26194 


lacZ533 


AR3 6 


Aplac5 




28083 


L3 6 


MBL0 02 


Aplac5 


25 


34968 


L3 6 


MBR2 02 


Aplac5 



Legend to Table 3 . 
Product sizes in integer base pairs are as predicted from the 
sequence and structure of A and Aplac5 as documented in Genbank 
accession no. J02459 and ref . (21) . Product sizes with decimal 

3 0 points in kb were determined by comparison with these products 

and with the A+Hindlll size standards labelled AH3 . The 
sequence of the primers is given in the Sequence Listing. 

Megaprimer consisted of gel -purified 3 84 bp PCR product 
DNA homologous to the region between the BamHl site and EcoRI 

3 5 site of the gene coding for the CryV ICP of Bacillus 

thuringiensis (14) , and primer-modified to remove these 
restriction sites. The PCR reactions in Figure 4 each employed 
megaprimer (3 00 ng) , primer BtV5 and 2 0 ng of genomic DNA from 
Bacillus thuringiensis strain NRD12 (15) , and enzyme as 

40 indicated in the description above of Figure 3. Cycling 
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conditions were 30 sec. 95°, 7 min. 60°, for 20 cycles. 

Hindlll digestion. Unf ract ionated, total PCR reactions 
for 28 and 35 kb targets were supplemented with 1/10 volume of 
lOXNaTMS (IX = 50 mM NaCl , 10 mM Tris-HCl pH 7 . 7 , 10 mM MgCl 2 , 
5 10 mM mercaptoethanol) and 2 ul (10 units) of restriction enzyme 

Hindlll, and incubated at 55° C. for 90 min. 

Test of exo- Pfu. Each 100 ul of reaction (incubated 
as 33 ul under 40 ul of oil) contained 2 ng Aplac5 DNA as 
purified phage particles, 20 pmoles each of primers MBL-1.7 and 
3D MBR , reaction buffer PC2 and 1 ul of Klentaq-278 (0.7 ug) , 

except for reaction 6, which contained 1 ul Pfu DNA polymerase 
(2.5 u.) alone. Other details are in the description of Fig. 
12. Thermal conditions were 24 cycles of 2 sec. at 94°, 11 min. 
at 70°. 

15 The discovery leading to the DNA polymerase mixture of 

the present invention was made during attempts to utilize in PCR 
a primer with a mismatched A-A base-pair at its 3' end. In fact 
the primer was itself a PCR product "megaprimer" of 384 bp, and 
the mismatched A had been added by Klentaq-278 using non- 
20 templated terminal transferase activity common to DNA 

polymerases (16) . Neither Klentaq-278 (Figure 3, lane 3) nor 
Pfu DNA polymerase (Figure 3, lanes 1 & 2 and other levels of 
enzyme not shown) could catalyze amplification of the 1500 bp 
target that lay between the PCR-product megaprimer and a 42mer 

2 5 oligonucleotide primer. The combination of the two enzymes, 

however, was well able to catalyze amplification of the desired 
target fragment (Fig 3, lane 4). Evidently, the Pfu DNA 
polymerase removed the presumed 3 ' A-A mismatch, allowing 
Klentaq-278 catalysis to proceed efficiently for each step of 

3 0 the PCR. The same result was obtained with Vent DNA polymerase 

substituted for Pfu (data not shown) . 

I hypothesized that mismatched 3 ' -ends are a general 
cause of inefficient primer extension during PCR of targets 
larger than a few kb . As a test system I employed a 6.6 kb 
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lambda DNA target which was amplified detectably but poorly by 
AmpliTaq, Klentaq-2 78 or Pfu DNA polymerase in a variety of 
standard conditions. Per 100 ul reaction volume, 1 ul of 
Klentaq-278 was combined with various amounts of Pfu DNA 
5 polymerase, from 1/2 ul down to as little as 1/200 ul of Pfu. 
Since the Pfu stock (2.5 units/ul) was at least 10 times less 
concentrated than the Klentaq-278 stock (25-30 units/ul) , the 
actual ratios tested were 1/20 to 1/2000 in DNA polymerase 
units. Representative results of these tests are shown in 
3,0 Figure IB. A high yield of target band was observed for all 

r tested combinations of the two enzymes, yet several levels of 

each enzyme on its own failed to catalyze more than faintly 
■j t detectable amplification. The lowest level of Pfu tested, 1/200 

ul, exhibited only a slight beneficial effect. The apparent 
15 broad optimum ratio of Klentaq-2 78 : Pf ul was 16 or 64 by volume, 

7 which is about 160 or 640 on the basis of DNA polymerase 

incorporation units. When tested at 6-8 kb (data not shown), 
other combinations of 3 ' -exo" and 3 ' -exo + thermostable DNA 
"■' polymerases also showed the effect, including Klentaq-2 7 8/Vent , 

2 0 Klentaq5 (DeltaTaq, USB) / Pfu, Stoffel Fragment/Pfu, Klentaq- 
278/Deep Vent (our co- favorite ; 48:1 by Volume, 720:1 by unit), 
and Pfu exo- / Pfu exo+ . Amplitaq/Pfu or Amplitaq/Pwo, at ratio 
of 25:1, are also very effective, but it is important that the 
Mg ++ be held to a level that is close to 0 . 75mM over the total 

2 5 level of dNTPs [for instance, 400 uM each dNTP, and 2.3 5 mM 

MgCl 2 . ] 

A very short heat step is preferred. I next attempted 
to amplify DNA in the size range 8.4 to 18 kb from lambda 
transducing phage template. Our early cycling protocol employed 

3 0 a denaturation step of 1 or 2 minutes at 95° or 98° C, but no 

useful product in excess of 8.4 kb was obtained until the 
parameters of this heat step were reduced to 2 sec. or 20 sec. 
at 93° or 94° C. In an experiment with the denaturation step at 
94° for 20, 60, or 180 sec, the 8.4 kb product exhibited 
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decreasing yield with increased length of this heat step (data 
not shown) . Apparently, a component of the reaction is at its 
margin of thermostability. Figure 2 shows that, using the short 
2 sec. denaturation step, target fragment was obtained for some 
5 reactions at all sizes in the range 8.4 to 18 kb, with very high 

product yields up to 15 kb if 3 0 PCR cycles were employed. 
Figure 2 also shows some failed reactions which I cannot 
explain. The failure mode that gives rise to massive ethidium 
staining in the sample well (30-cyclelane 2) was particularly 

10 common, especially at high enzyme levels. 

Longer Primers. A change in primer length from 27 to 
33 greatly reduced the frequency of failed reactions. Figure 4 
demonstrates improved reliability for amplification of 12.5, 15 
and 18 kb with the longer 33mer primers, under conditions of 

15 otherwise optimally high enzyme levels in which the 2 7mer 

primers failed to give rise to desirable target product. This 
result does not represent an extensive survey of primer length, 
and it has not yet been repeated with the improvements below. 
Therefore the optimum primer length for long PCR remains to be 

20 determined. Some of the amplifications analyzed in Figure 5 
utilized 36mer primers from the very ends of X. A 2-5 min. 
preincubation at 68-72° (22) was necessary to release the 
template DNA from the phage particles and to fill in the sticky 
ends of lambda to complete the template homology with primers 

25 AL3 6 and XR3 6. 

Filtered Tips. For repeated experiments in the same 
laboratory with the same primer sets, some sort of carry-over 
product can contaminate the pipetter barrels and stock 
solutions, and it is now believed that this is the main cause of 

3 0 the failed reactions shown in Figure 4. The nature of the 

carried-over product has not yet been determined, but it seems 
to act as a "bad seed" to recruit good PCR product into the 
intractable material that is at the wells and does not enter the 
gel in the failed lanes of Figure 4 . 
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This carry-over contamination problem is effectively 
combatted by two measures: 1) Always use different pipets for 
assembly (before cycling) and gel analysis (after cycling) 
procedures. 2) Always use the pipet tips with filters in each 
5 one, also known as aerosol resistant tips (ART) . 

When the above two measures are employed, 2 7 mer 
primers and primers as short as 23 base pairs often work well 
for the long and accurate PCr. When compared directly, 33 mer 
primers continue to outperform 23 mer primers, but the 

ID difference is now slight (less than 3 -fold improvement) . 

Rapid cycling. A change to thin-walled tubes, which 
have lower heat capacity and conduct heat more efficiently, 
further improved the reactions. Figure 5 shows a CHEF pulse- 
field agarose gel analysis of successful amplifications of DNA 

15 spans 6-26 kb in size. The target of 28 kb was not amplifiable 

in the Omnigene thermal cycler (data not shown) , but did appear 
(Figure 6, lane 2) when the RoboCycler was employed. 

Several models of thermal cycler have been employed, 
and although not all have been optimized, some are preferable to 

2 0 others for long PCR. As may be concluded from the advantage of 
thin-walled tubes noted above, success seems to be positively 
correlated with a high speed of temperature change made possible 
by the design of the thermal cycler. The RoboCycler achieves 
rapid temperature change by moving tubes from block to block, 

25 and observations with a thermistor temperature probe indicate 

that it raises the reactions to 93-95° for only 5 sec. under the 
denaturation conditions employed (30 sec. in the 99° block), 
before rapidly (within 30 sec) returning the reaction to 68°. 

Higher pH. The current record 3 5 kb (Figure 6, lane 3) 

30 was only amplifiable if the pH was increased. A preliminary 
scan of higher pH was carried out (data not shown) , and this 
resulted in the appearance of the 3 5 kb band at pH 8.8 to 9.2, 
with the optimum at 9.1 as described in Methods (above) . 
Further improvement to a high yield of the 3 5 kb product was 
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achieved by lengthening the extension time to 24 min. Other 
than the higher pH, the long PCR procedure has not yet realized 
any potential benefits from changes in buffer conditions from 
those optimized for 8.4 kb. For Targets over 2 0 kbs extension 
5 times exceeding 2 0 min. are preferred and the extension 

temperature is preferably below 69° C. 

Identity of long PCR products. It can be seen in 
Figures 2, 4 and 5 that the mobilities of the successful large 
DNA products agree with those predicted in Table 3 from the 

10 known map positions of the primers used. 

Hindi I I restriction enzyme digestion of the unpurified 
28 and 35 kb products (Figure 6, lanes 6 and 7) resulted in the 
expected left arm of lambda (23 kb) and 2.3 kb band from both, 
and the predictable bands terminated by the right PCR primer: 

15 44 7 bp (barely visible) from the 2 8 kb product and 73 31 bp from 

the 3 5 kb product. 

Exonuclease mutant. The available mutant of Pfu DNA 
polymerase (8) which is defective in the 3 ' -exonuclease activity 
was tested. Figure 7 shows that the 3 ' -exo" mutant of Pfu DNA 

20 polymerase fails to promote efficient amplification of a long 
DNA target. This supports our hypothesis that the 3'- 
exonuclease activity is important for the efficiency of PCR 
amplification in this size range. 

Fidelity test. Since the biological purpose of 3 ' - 

25 exonuclease is to edit base pair mismatches for high replication 

fidelity, we tested the fidelity of the PCR product using an 
assay involving the amplification and molecular cloning of an 
entire lacZ ( (3 -galactosidase) gene flanked by two selectable 
markers (10) . Heretofore the highest reported fidelity of PCR 

30 amplification is that catalyzed by Pfu DNA polymerase (2) . 

Table 4 shows that the fidelity of the product amplified by the 
640:1 mixture of Klentaq-278 and Pfu DNA polymerase at least 
matches that obtained for Pfu DNA polymerase, alone, when each 
are used for 16 cycles of PCR. Our designation of the enzyme 
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mixture as Klentaq-LA (KlenTaq Long and Accurate) reflects this 
high fidelity performance. 

Table 4. Non-silent mutations introduced into the lacZ 
gene by 16 cycles of PCR (10) . 





LacZ+ 


LacZ- 


% 


Effective 


Errors 


Fold 


Enzyme 


Blue 


Light Blue 


mutant 


cycle no. 


per 10 5 


ment 




or White 






(c) 


bp (b) 


full- 


KTLA-64 


571 


34 


5.6 


12 


1 . 05 


12 .7 


Pfu 


528 


37 


6.5 


8 


1.9 


6.9 


Klentaq5 a 


442 


85 


16.1 


8 


5.1 


2.6 


Klentaql 


3225 


985 


26.4 


8 


9.0 


1.5 


Amplitaq 


525 


301 


36.4 


8 


13 .4 


1.0 



(a) Klentaq 5 is the N-terminal deletion of Taq DNA polymerase 
described in ref. 10. (b) Equation 1 of reference 10 was 
rearranged to be as follows to solve for errors per bp: X = - 
20 (In (2F (1/m " 1) - 1 ) )/1000, where X is the errors per bp 

incorporated, 1000 is the effective target size in the lacZ gene 

(10) , F is the fraction of blue colonies, and m is the effective 
cycle number. (c) As in ref. 10, the effective cycle number was 
estimated at less than the machine cycles to reflect the actual 

2 5 efficiency of the reaction, yet higher than the minimum 

calculated from the fold-amplification. Strand loss due to 
incomplete synthesis of product strands is a probable cause of 
lower than ideal amplification efficiency. Therefore successful 
(not lost) product molecules are judged to have undergone more 
30 than the calculated minimum number of replications. KTLA-64 

(Klentaq-278 : Pfu : : 64 : 1 by volume) was assigned a higher 
effective cycle number since its reactions started with 10 times 
less DNA (1.5 ng vs. 15 ng plasmid pWB3 05) to result in 
comparable levels of product. 

3 5 DISCUSSION 
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The previous length limitation for PCR amplification 
is postulated to have been caused by low efficiency of extension 
at the sites of incorporation of mismatched base pairs. 
Although it would have seemed that the cure for these mismatches 
5 would be to employ enzymes with 3 1 - (editing) -exonucleases , I 
believe that when Pfu and Vent DNA polymerase are used to 
catalyze our amplifications on their own, their failure is due 
to degradation of the PCR primers by their 3 ' -exonucleases, 
especially during the required long synthesis times and at 

10 optimally high DNA polymerase levels. Evidently, low levels of 
3 ' -exonuclease are sufficient and optimal for removal of the 
mismatches to allow the Klentaq-278 and amplification to 
proceed. It has been demonstrated that the optimally low level 
of 3 ' -exonuclease can be set effectively, conveniently, and 

15 flexibly by mixing and dilution. 

Preferably the ratio of exo-/exo+ enzyme is high. If 
equal levels of the two types of enzymes are used (or where the 
E2 component is in excess) , and in many embodiments tested, 
where the ratio of exo-/exo+ is 4 or less, the effectiveness of 

20 the long PCR, even under optimal cycling conditions discussed 
below, is non-existent or much reduced. 

It is preferred, and for certain applications, 
important that the length and temperature of the heat 
denaturation step of the PCR be kept to a minimum. Further, the 

2 5 improvement obtained by increasing the pH slightly may 

correspond to a decrease in template depurination . If so, 
further improvements may result if depurination can be reduced, 
or if a majority DNA polymerase component can be found which is 
able to bypass depurination sites. 

3 0 The short denaturation time found to be optimal, 

preferably less than 2 0 sec, and most preferably, 5 sec. or 
less in the reaction itself at 95°, is surprisingly effective 
for the amplification of 35 kb, whereas it might have been 
expected that longer PCR targets would need longer denaturation 
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time to become completely denatured. If complete denaturat ion 
is required for PCR, and if longer DNA requires more time to 
unwind at 95°, the required unwinding time may eventually become 
significantly more than 5 seconds. This could limit the size of 
amplifiable product because of the increased depurination caused 
by longer denaturation times. 

These amplifications were successful with several 
different target sequences, with several primer combinations, 
and with product sizes up to nearly twice the maximum size of 
inserts cloned into A. Whole viruses and plasmids up to 3 5 kb 
in length should now be amplifiable with this system. Should 
this method prove applicable to DNA of higher complexity than A, 
it could prove a boon to genomic mapping and sequencing 
applications, since in vitro amplification is convenient and 
avoids the DNA rearrangement and gene toxicity pitfalls of in 
vivo cloning. 

In view of the above, it will be seen that the several 
objects of the invention are achieved and other advantageous 
results attained. 

As various changes could be made in the above methods 
and products without departing from the scope of the invention, 
it is intended that all matter contained in the above 
description shall be interpreted as illustrative and not in a 
limiting sense. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: Barnes Ph.D., Wayne M 

(ii) TITLE OF INVENTION: DNA polymerases with 
enhanced length and 
efficiency of primer extension 

(iii) NUMBER OF SEQUENCES : 2 9 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Senniger, Powers, Leavitt & Roedel 

(B) STREET: One Metropolitan Square 

(C) CITY: St. Louis 

(D) STATE: Missouri 

(E) COUNTRY: USA 

(F) ZIP: 63102 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER : 

(B) FILING DATE: 

(C) CLASSIFICATION: 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Blosser, G.Harley 

(B) REGISTRATION NUMBER: 33,650 

(C) REFERENCE/DOCKET NUMBER: WNB4912 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (314) 231-5400 

(B) TELEFAX: (314) 231-4342 

(C) TELEX: 6502697583 MCI 



(2) INFORMATION FOR SEQ ID NO : 1 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(v) FRAGMENT TYPE: N-terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Thermus aquaticus 

(B) STRAIN: YT1 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY : synthetic 

(B) CLONE: KT1 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 6 . . 35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

GAGCC ATG GGC CTC CTC CAC GAG TTC GGC CTT CTG G 
36 

Met Gly Leu Leu His Glu Phe Gly Leu Leu 
15 10 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Gly Leu Leu His Glu Phe Gly Leu Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: YES 

(v) FRAGMENT TYPE: C-terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Thermus aquaticus 

(B) STRAIN: YT1 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: synthetic 

(B) CLONE: Klentaq32 

( ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: complement (8.. 34) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 

. GCGAAGCTTA CTACTCCTTG GCGGAGAGCC AGTCC 
35 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 

Asp Trp Leu Ser Ala Lys Glu * * 

1 5 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6714 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 
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(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Expression vector 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: pWB254b 

( ix) FEATURE : 

(A) NAME /KEY: CDS 

(B) LOCATION: 1 . . 1665 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

ATG GGC CTC CTC CAC GAG TTC GGC CTT CTG GAA AGC CCC AAG GCC CTG 
Met Gly Leu Leu His Glu Phe Gly Leu Leu Glu Ser Pro Lys Ala Leu 
15 10 15 

GAG GAG GCC CCC TGG CCC CCG CCG GAA GGG GCC TTC GTG GGC TTT GTG 
GlU Glu Ala Pro Trp Pro Pro Pro Glu Gly Ala Phe Val Gly Phe Val 
20 25 30 

CTT TCC CGC AAG GAG CCC ATG TGG GCC GAT CTT CTG GCC CTG GCC GCC 
Leu Ser Arg Lys Glu Pro Met Trp Ala Asp Leu Leu Ala Leu Ala Ala 
35 40 45 

GCC AGG GGG GGC CGG GTC CAC CGG GCC CCC GAG CCT TAT AAA GCC CTC 
Ala Arg Gly Gly Arg Val His Arg Ala Pro Glu Pro Tyr Lys Ala Leu 
50 55 60 

AGG GAC CTG AAG GAG GCG CGG GGG CTT CTC GCC AAA GAC CTG AGC GTT 
Arg Asp Leu Lys Glu Ala Arg Gly Leu Leu Ala Lys Asp Leu Ser Val 
65 70 75 80 

CTG GCC CTG AGG GAA GGC CTT GGC CTC CCG CCC GGC GAC GAC CCC ATG 
Leu Ala Leu Arg Glu Gly Leu Gly Leu Pro Pro Gly Asp Asp Pro Met 
85 90 95 

CTC CTC GCC TAC CTC CTG GAC CCT TCC AAC ACC ACC CCC GAG GGG GTG 
Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn Thr Thr Pro Glu Gly Val 
100 105 110 

GCC CGG CGC TAC GGC GGG GAG TGG ACG GAG GAG GCG GGG GAG CGG GCC 
Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu Glu Ala Gly Glu Arg Ala 
115 120 125 
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GCC CTT TCC GAG AGG CTC TTC GCC AAC CTG TGG GGG AGG CTT GAG GGG 

Ala Leu Ser Glu Arg Leu Phe Ala Asn Leu Trp Gly Arg Leu Glu Gly 

130 135 140 

GAG GAG AGG CTC CTT TGG CTT TAC CGG GAG GTG GAG AGG CCC CTT TCC 

Glu Glu Arg Leu Leu Trp Leu Tyr Arg Glu Val Glu Arg Pro Leu Ser 

145 ' 150 ~ 155 160 

GCT GTC CTG GCC CAC ATG GAG GCC ACG GGG GTG CGC CTG GAC GTG GCC 

Ala Val Leu Ala His Met Glu Ala Thr Gly Val Arg Leu Asp Val Ala 

165 170 175 

TAT : CTC AGG GCC TTG TCC CTG GAG GTG GCC GAG GAG ATC GCC CGC CTC 

Tyr;: Leu Arg Ala Leu Ser Leu Glu Val Ala Glu Glu lie Ala Arg Leu 

d 180 185 190 

GAG GCC GAG GTC TTC CGC CTG GCC GGC CAC CCC TTC AAC CTC AAC TCC 

Glu Ala Glu Val Phe Arg Leu Ala Gly His Pro Phe Asn Leu Asn Ser 

5 195 200 205 

CGG GAC CAG CTG GAA AGG GTC CTC TTT GAC GAG CTA GGG CTT CCC GCC 

Arg, Asp Gin Leu Glu Arg Val Leu Phe Asp Glu Leu Gly Leu Pro Ala 

210 215 220 

ATC GGC AAG ACG GAG AAG ACC GGC AAG CGC TCC ACC AGC GCC GCC GTC 

lie Gly Lys Thr Glu Lys Thr Gly Lys Arg Ser Thr Ser Ala Ala Val 

225' 230 ~ 235 240 

CTG GAG GCC CTC CGC GAG GCC CAC CCC ATC GTG GAG AAG ATC CTG CAG 

Leu Glu Ala Leu Arg Glu Ala His Pro lie Val Glu Lys lie Leu Gin 

245 250 255 

TAC CGG GAG CTC ACC AAG CTG AAG AGC ACC TAC ATT GAC CCC TTG CCG 

Tyr Arg Glu Leu Thr Lys Leu Lys Ser Thr Tyr lie Asp Pro Leu Pro 

260 265 270 

GAC CTC ATC CAC CCC AGG ACG GGC CGC CTC CAC ACC CGC TTC AAC CAG 

Asp Leu lie His Pro Arg Thr Gly Arg Leu His Thr Arg Phe Asn Gin 

275 " 280 ~ 285 

ACG GCC ACG GCC ACG GGC AGG CTA AGT AGC TCC GAT CCC AAC CTC CAG 

Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin 

290 295 300 

AAC ATC CCC GTC CGC ACC CCG CTT GGG CAG AGG ATC CGC CGG GCC TTC 

Asn lie Pro Val Arg Thr Pro Leu Gly Gin Arg lie Arg Arg Ala Phe 

305 310 315 " 320 



ATC GCC GAG GAG GGG TGG CTA TTG GTG GCC CTG GAC TAT AGC CAG ATA 
lie Ala Glu Glu Gly Trp Leu Leu Val Ala Leu Asp Tyr Ser Gin lie 
325 330 335 
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GAG CTC AGG GTG CTG GCC CAC CTC TCC GGC GAC GAG AAC CTG ATC CGG 

Glu Leu Arg Val Leu Ala His Leu Ser Gly Asp Glu Asn Leu lie Arg 

340 345 " 350 

GTC TTC CAG GAG GGG CGG GAC ATC CAC ACG GAG ACC GCC AGC TGG ATG 

Val Phe Gin Glu Gly Arg Asp lie His Thr Glu Thr Ala Ser Trp Met 

355 ~' 360 365 

TTC GGC GTC CCC CGG GAG GCC GTG GAC CCC CTG ATG CGC CGG GCG GCC 

Phe Gly Val Pro Arg Glu Ala Val Asp Pro Leu Met Arg Arg Ala Ala 

370 375 380 

AAG? ACC ATC AAC TTC GGG GTC CTC TAC GGC ATG TCG GCC CAC CGC CTC 

Lys 1 : Thr lie Asn Phe Gly Val Leu Tyr Gly Met Ser Ala His Arg Leu 

385 390 395 400 

TCC CAG GAG CTA GCC ATC CCT TAC GAG GAG GCC CAG GCC TTC ATT GAG 

Seff Gin Glu Leu Ala He Pro Tyr Glu Glu Ala Gin Ala Phe He Glu 

405 410 415 

CGC TAC TTT CAG AGC TTC CCC AAG GTG CGG GCC TGG ATT GAG AAG ACC 

Arg ! Tyr Phe Gin Ser Phe Pro Lys Val Arg Ala Trp He Glu Lys Thr 

C) 420 * 425 ~ ~ 430 

CTG- GAG GAG GGC AGG AGG CGG GGG TAC GTG GAG ACC CTC TTC GGC CGC 

Leu Glu Glu Gly Arg Arg Arg Gly Tyr Val Glu Thr Leu Phe Gly Arg 

435 440 445 

CGC CGC TAC GTG CCA GAC CTA GAG GCC CGG GTG AAG AGC GTG CGG GAG 

Arg Arg Tyr Val Pro Asp Leu Glu Ala Arg Val Lys Ser Val Arg Glu 

450 455 ~ 460 

GCG GCC GAG CGC ATG GCC TTC AAC ATG CCC GTC CAG GGC ACC GCC GCC 

Ala Ala Glu Arg Met Ala Phe Asn Met Pro Val Gin Gly Thr Ala Ala 

465 ' 470 475 " 480 

GAC CTC ATG AAG CTG GCT ATG GTG AAG CTC TTC CCC AGG CTG GAG GAA 

Asp Leu Met Lys Leu Ala Met Val Lys Leu Phe Pro Arg Leu Glu Glu 

485 490 ~ 495 

ATG GGG GCC AGG ATG CTC CTT CAG GTC CAC GAC GAG CTG GTC CTC GAG 

Met Gly Ala Arg Met Leu Leu Gin Val His Asp Glu Leu Val Leu Glu 

500 505 510 

GCC CCA AAA GAG AGG GCG GAG GCC GTG GCC CGG CTG GCC AAG GAG GTC 

Ala Pro Lys Glu Arg Ala Glu Ala Val Ala Arg Leu Ala Lys Glu Val 

515 520 525 

ATG GAG GGG GTG TAT CCC CTG GCC GTG CCC CTG GAG GTG GAG GTG GGG 

Met Glu Gly Val Tyr Pro Leu Ala Val Pro Leu Glu Val Glu Val Gly 

530 535 540 
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ATA GGG GAG GAC TGG CTC TCC GCC AAG GAG TAGTAAGCTT AT C GAT GATA 1682 
lie Gly Glu Asp Trp Leu Ser Ala Lys Glu 
545 550 555 

AGCTGTCAAA CATGAGAATT AGCCCGCCTA ATGAGCGGGC TTTTTTTTAA TTCTTGAAGA 1742 

CGAAAGGGCC TCGTGATACG CCTATTTTTA TAGGTTAATG TCATGATAAT AATGGTTTCT 1802 

TAGCGTCAAA GCAACCATAG TACGCGCCCT GTAGCGGCGC ATTAAGCGCG CCGGGTGTGG 1862 

TGGTTACGCG CAGCGTGACC GCTACACTTG CCAGCGCCCT AGCGCCCGCT CCTTTCGCTT 1922 

TCTTCCCTTC CTTTCTCGCC ACGTTCGCCG GCTTTCCCCG TCAAGCTCTA AATCGGGGGC 1982 

TCCCTTTAGG GTTCCGATTT AGTGCTTTAC GGCACCTCGA CCCCAAAAAA CTTGATTTGG 2 042 

GTGATGGTTC ACGTAGTGGG CCATCGCCCT GATAGACGGT TTTTCGCCCT TTGACGTTGG 2102 

AGTCCACGTT CTTTAATAGT GGACTCTTGT TCCAAACTTG AACAACACTC AACCCTATCT 2162 

CGGGCTATTC TTTTGATTTA TAAGGGATTT TGCCGATTTC GGCCTATTGG TTAAAAAATG 2222 

AGCTGATTTA ACAAAAATTT AACGCGAATT TTAACAAAAT ATTAACGTTT ACAATTTCAG 22 82 

GTGGCACTTT TCGGGGAAAT GTGCGCGGAA CCCCTATTTG TTTATTTTTC TAAATACATT 2 342 

CAAATATGTA TCCGCTCATG AGACAATAAC CCTGATAAAT GCTTCAATAA TATTGAAAAA 24 02 

GGAAGAGTAT GAGTATTCAA CATTTCCGTG TCGCCCTTAT TCCCTTTTTT GCGGCATTTT 2462 

GCCTTCCTGT TTTTGCTCAC CCAGAAACGC TGGTGAAAGT AAAAGATGCT GAAGATCAGT 2 522 

TGGGTGCACG AGTGGGTTAC ATCGAACTGG ATCTCAACAG CGGTAAGATC CTTGAGAGTT 2582 

TTCGCCCCGA AGAACGTTTT CCAATGATGA GCACTTTTAA AGTTCTGCTA TGTGGCGCGG 2642 

TATTATCCCG TGTTGACGCC GGGCAAGAGC AACTCGGTCG CCGCATACAC TATTCTCAGA 2 702 

ATGAC TTGGT TGAGTACTCA CCAGTCACAG AAAAGCATCT TACGGATGGC ATGACAGTAA 2 762 

GAGAATTATG CAGTGCTGCC ATAACCATGA GTGATAACAC TGCGGCCAAC TTACTTCTGA 2 822 

CAACGATCGG AGGACCGAAG GAGCTAACCG CTTTTTTGCA CAACATGGGG GATCATGTAA 2 8 82 

CTCGCCTTGA TCGTTGGGAA CCGGAGCTGA ATGAAGC CAT AC CAAACGAC GAGCGTGACA 2 942 

CCACGATGCC TGCAGCAATG GCAACAACGT TGCGCAAACT ATTAACTGGC GAACTACTTA 3 0 02 

CTCTAGCTTC CCGGCAACAA TTAATAGACT GGATGGAGGC GGATAAAGTT GCAGGACCAC 3 062 

TTCTGCGCTC GGCCCTTCCG GCTGGCTGGT TTATTGCTGA TAAATCTGGA GCCGGTGAGC 3122 
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GTGGGTCTCG CGGTATCATT GCAGCACTGG GGCCAGATGG TAAGCCCTCC CGTATCGTAG 3182 

TTATCTACAC GACGGGGAGT CAGGCAACTA TGGATGAACG AAATAGACAG ATCGCTGAGA 3242 

TAGGTGCCTC ACTGATTAAG CATTGGTAAC TGTCAGACCA AGTTTACTCA TATATACTTT 3 3 02 

AGATTGATTT AAAACTTCAT TTTTAATTTA AAAGGATCTA GGTGAAGATC CTTTTTGATA 3 3 62 

ATCTCATGAC CAAAATCCCT TAACGTGAGT TTTCGTTCCA CTGAGCGTCA GACCCCGTAG 3422 

AAAAGATCAA AGGATCTTCT TGAGATCCTT TTTTTCTGCG CGTAATCTGC TGCTTGCAAA 3482 

CAAAAAAACC ACCGCTACCA GCGGTGGTTT GTTTGCCGGA TCAAGAGCTA CCAACTCTTT 3542 

TTO^GAAGGT AACTGGCTTC AGCAGAGCGC AGATAC C AAA TACTGTCCTT CTAGTGTAGC 3 602 

CGTAbTTAGG CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA 3 662 

TCCTGTTACC AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA 3 72 2 

GAC&ATAGTT AC CGGATAAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TGCACACAGC 3 782 

CCA3CTTGGA GCGAACGACC TACACCGAAC TGAGATACCT ACAGC GTGAG CTATGAGAAA 3 842 

GCGOCACGCT TCCCGAAGGG AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA 3 902 

CAGSAGAGCG CACGAGGGAG CTTCCAGGGG GAAACGCCTG GTATCTTTAT AGTCCTGTCG 3 962 

GGTTTCGCCA CCTCTGACTT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG GGGCGGAGCC 4 022 

TATGGAAAAA CGCCAGCAAC GCGGCCTTTT TACGGTTCCT GGCCTTTTGC TGGCCTTTTG 4 082 

CTCACATGTT CTTTCCTGCG TTATCCCCTG ATTCTGTGGA TAACCGTATT ACCGCCTTTG 4142 

AGTGAGCTGA TACCGCTCGC CGCAGCCGAA CGACCGAGCG CAGCGAGTCA GTGAGCGAGG 4202 

AAGCGGAAGA GCGCCTGATG CGGTATTTTC TCCTTACGCA TCTGTGCGGT ATTTCACACC 42 62 

GCATATGGTG CACTCTCAGT ACAATCTGCT CTGATGCCGC ATAGTTAAGC CAGTATACAC 4 3 22 

TCCGCTATCG CTACGTGACT GGGTCATGGC TGCGCCCCGA CACCCGCCAA CACCCGCTGA 4 3 82 

CGCGCCCTGA CGGGCTTGTC TGCTCCCGGC ATCCGCTTAC AGACAAGCTG TGACCGTCTC 4442 

CGGGAGCTGC ATGTGTCAGA GGTTTTCACC GTCATCACCG AAACGCGCGA GGCAGAACGC 4 502 

CAT C AAAAAT AATTCGCGTC TGGCCTTCCT GTAGCCAGCT TTCATCAACA TTAAATGTGA 4562 

GCGAGTAACA ACCCGTCGGA TTCTCCGTGG GAACAAACGG CGGATTGACC GTAATGGGAT 4 622 

AGGTTACGTT GGTGTAGATG GGCGCATCGT AACCGTGCAT CTGCCAGTTT GAGGGGACGA 4 6 82 
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CGACAGTATC GGCCTCAGGA AGATCGCACT CCAGCCAGCT TTCCGGCACC GCTTCTGGTG 4742 

C CGGAAAC CA GGCAAAGCGC CATTCGCCAT TCAGGCTGCG CAACTGTTGG GAAGGGCGAT 4802 

CGGTGCGGGC CTCTTCGCTA TTACGCCAGC TGGCGAAAGG GGGATGTGCT GCAAGGCGAT 4 8 62 

TAAGTTGGGT AACGCCAGGG TTTTCCCAGT CACGACGTTG TAAAACGACG GCCAGTGAAT 4 922 

CCGTAATCAT GGTCATAGCT GTTTCCTGTG TGAAATTGTT ATCCGCTCAC AATTCCACAC 4 982 

AACATACGAG CCGGAAGCAT AAAGTGTAAA GCCTGGGGTG CCTAATGAGT GAGCTAACTC 5042 

ACATTAATTG CGTTGCGCTC ACTGCCCGCT TTCCAGTCGG GAAACCTGTC GTGCCAGCTG 5102 

CATpAATGAA TCGGCCAACG CGCGGGGAGA GGCGGTTTGC GTATTGGGCG CCAGGGTGGT 5162 

TTTTCTTTTC AC CAGTGAGA CGGGCAACAG CTGATTGCCC TTCACCGCCT GGCCCTGAGA 5222 

GAGTTGCAGC AAGCGGTCCA CGCTGGTTTG CCCCAGCAGG CGAAAATCCT GTTTGATGGT 52 82 

GGTjtGACGGC GGGATATAAC ATGAGCTGTC TTCGGTATCG TCGTATCCCA CTACCGAGAT 53 4 2 

ATCPGCACCA ACGCGCAGCC CGGACTCGGT AATGGCGCGC ATTGCGCCCA GCGCCATCTG 54 02 

ATCS-TTGGCA ACCAGCATCG CAGTGGGAAC GATGCCCTCA TTCAGCATTT GCATGGTTTG 54 62 

TTGkAAACCG GACATGGCAC TCCAGTCGCC TTCCCGTTCC GCTATCGGCT GAATTTGATT 5 522 

GCGAGTGAGA TATTTATGCC AGCCAGCCAG ACGCAGACGC GCCGAGACAG AACTTAATGG 5582 

GCCCGCTAAC AGCGCGATTT GCTGGTGACC CAATGCGACC AGATGCTCCA CGCCCAGTCG 5 64 2 

CGTACCGTCT TCATGGGAGA AAATAATACT GTTGATGGGT GTCTGGTCAG AGACATCAAG 5 7 02 

AAATAACGCC GGAACATTAG TGCAGGCAGC TTCCACAGCA ATGGCATCCT GGTCATCCAG 5762 

CGGATAGTTA ATGATCAGCC CACTGACGCG TTGCGCGAGA AGATTGTGCA CCGCCGCTTT 5 82 2 

ACAGGCTTCG ACGCCGCTTC GTTCTACCAT CGACACCACC ACGCTGGCAC CCAGTTGATC 58 82 

GGCGCGAGAT TTAATCGCCG CGACAATTTG CGACGGCGCG TGCAGGGCCA GACTGGAGGT 5 942 

GGCAACGC C A ATCAGCAACG ACTGTTTGCC CGC CAGTTGT TGTGC CACGC GGTTGGGAAT 6 0 02 

GTAATTCAGC TCCGCCATCG CCGCTTCCAC TTTTTCCCGC GTTTTCGCAG AAACGTGGCT 6 0 62 

GGCCTGGTTC ACCACGCGGG AAACGGTCTG ATAAGAGACA CCGGCATACT CTGCGACATC 612 2 

GTATAACGTT ACTGGTTTCA CATTCACCAC CCTGAATTGA CTCTCTTCCG GGCGCTATCA 6182 

TGCCATACCG CGAAAGGTTT TGCGCCATTC GATGGTGTCC CAGTGAATCC GTAATCATGG 6242 
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TCATAGCTGT TTCCTGTGTG AAATTGTTAT CCGCTCACAA TTCCACACAT TATACGAGCC 63 02 

GGAAGCATAA AGTGTAAAGC CTGGGGTGCC TAATGAGTGA GCTAACTCAC ATTAATTGCG 63 62 

TTGCGCTCAC TGCCCGCTTT CCAGTCGGGA AACCTGTCGT GCCAGCTGCA TTAATGAATC 6422 

GGAGCTTACT CCCCATCCCC CTGTTGACAA TTAATCATCG GCTCGTATAA TGTGTGGAAT 64 82 

TGTGAGCGGA TAACAATTTC ACACAGGAAA CAGGATCGAT CCAGCTTACT CCCCATCCCC 6542 

CTGTTGACAA TTAATCATCG GCTCGTATAA TGTGTGGAAT TGTGAGCGGA TAACAATTTC 6602 

ACACAGGAAA CAGGATCTGG GCCCTTCGAA AT TAATAC GA CTCACTATAG GGAGACCACA 6662 

ACGGTTTCCC TCTAGAAATA ATTTTGTTTA ACTTTAAGAA GGAGATATAT CC 6714 

(2) T INFORMATION FOR SEQ ID NO : 6 : 

T (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 554 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

- (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Gly Leu Leu His Glu Phe Gly Leu Leu Glu Ser Pro Lys Ala Leu 
1 5 10 15 

Glu Glu Ala Pro Trp Pro Pro Pro Glu Gly Ala Phe Val Gly Phe Val 
20 25 30 

Leu Ser Arg Lys Glu Pro Met Trp Ala Asp Leu Leu Ala Leu Ala Ala 
35 40 45 

Ala Arg Gly Gly Arg Val His Arg Ala Pro Glu Pro Tyr Lys Ala Leu 
50 55 60 

Arg Asp Leu Lys Glu Ala Arg Gly Leu Leu Ala Lys Asp Leu Ser Val 
65 70 75 80 

Leu Ala Leu Arg Glu Gly Leu Gly Leu Pro Pro Gly Asp Asp Pro Met 
85 90 95 

Leu Leu Ala Tyr Leu Leu Asp Pro Ser Asn Thr Thr Pro Glu Gly Val 
100 105 110 

Ala Arg Arg Tyr Gly Gly Glu Trp Thr Glu Glu Ala Gly Glu Arg Ala 
H5 120 125 
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Ala Leu Ser Glu Arg Leu Phe Ala Asn Leu Trp Gly Arg Leu Glu Gly 
130 135 140 

Glu Glu Arg Leu Leu Trp Leu Tyr Arg Glu Val Glu Arg Pro Leu Ser 
145 150 155 160 

Ala Val Leu Ala His Met Glu Ala Thr Gly Val Arg Leu Asp Val Ala 
165 170 175 

Tyr Leu Arg Ala Leu Ser Leu Glu Val Ala Glu Glu lie Ala Arg Leu 
180 185 190 

Glu -Ala Glu Val Phe Arg Leu Ala Gly His Pro Phe Asn Leu Asn Ser 
195 200 205 

Arg -Asp Gin Leu Glu Arg Val Leu Phe Asp Glu Leu Gly Leu Pro Ala 
;_210 215 220 

lie Gly Lys Thr Glu Lys Thr Gly Lys Arg Ser Thr Ser Ala Ala Val 
225 230 235 240 

Leu =Glu Ala Leu Arg Glu Ala His Pro lie Val Glu Lys lie Leu Gin 
245 250 255 

Tyr Arg Glu Leu Thr Lys Leu Lys Ser Thr Tyr lie Asp Pro Leu Pro 
260 265 270 

Asp Leu lie His Pro Arg Thr Gly Arg Leu His Thr Arg Phe Asn Gin 
275 280 285 

Thr Ala Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp Pro Asn Leu Gin 
290 295 300 

Asn lie Pro Val Arg Thr Pro Leu Gly Gin Arg lie Arg Arg Ala Phe 
305 310 315 320 

lie Ala Glu Glu Gly Trp Leu Leu Val Ala Leu Asp Tyr Ser Gin lie 
325 330 335 

Glu Leu Arg Val Leu Ala His Leu Ser Gly Asp Glu Asn Leu lie Arg 
340 345 350 

Val Phe Gin Glu Gly Arg Asp lie His Thr Glu Thr Ala Ser Trp Met 
355 360 365 

Phe Gly Val Pro Arg Glu Ala Val Asp Pro Leu Met Arg Arg Ala Ala 
370 375 380 

Lys Thr lie Asn Phe Gly Val Leu Tyr Gly Met Ser Ala His Arg Leu 
385 390 395 400 
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Ser Gin Glu Leu Ala lie 
405 

Arg Tyr Phe Gin Ser Phe 
420 

Leu Glu Glu Gly Arg Arg 
435 

Arg Arg Tyr Val Pro Asp 
450 

Ala = Ala Glu Arg Met Ala 
465^ 470 

Asp_Leu Met Lys Leu Ala 
485 

Met -Gly Ala Arg Met Leu 
500 

Ala' Pro Lys Glu Arg Ala 
515 

Met^Glu Gly Val Tyr Pro 
53 0 

Ile^Gly Glu Asp Trp Leu 
545 550 



Pro Tyr Glu Glu Ala 
410 

Pro Lys Val Arg Ala 
425 

Arg Gly Tyr Val Glu 
440 



Leu Glu Ala Arg Val 
455 

Phe Asn Met Pro Val 
475 

Met Val Lys Leu Phe 
490 

Leu Gin Val His Asp 
505 

Glu Ala Val Ala Arg 
520 

Leu Ala Val Pro Leu 
535 

Ser Ala Lys Glu 



Gin Ala Phe lie Glu 
415 

Trp lie Glu Lys Thr 
430 

Thr Leu Phe Gly Arg 
445 

Lys Ser Val Arg Glu 
460 

Gin Gly Thr Ala Ala 
480 

Pro Arg Leu Glu Glu 
495 

Glu Leu Val Leu Glu 
510 

Leu Ala Lys Glu Val 
525 

Glu Val Glu Val Gly 
540 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: bacteriophage lambda 

(B) STRAIN: PaPa 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: MBL 

(viii) POSITION IN GENOME: 

(B) MAP POSITION: 27940 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GCTTATCTGC TTCTCATAGA GTCTTGC 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

7(h) MOLECULE TYPE: DNA (genomic) 

;(vi) ORIGINAL SOURCE: 

(A) ORGANISM: bacteriphage lambda 

(B) STRAIN: PaPa 

Jvil) IMMEDIATE SOURCE: 
(B) CLONE: MBR 



=-(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
ATAACGATCA TATACATGGT TCTCTCC 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: bacteriophage lambda 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: MBL-1.7 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

TTTTGCTGGG TCAGGTTGTT CTTTAGG 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 8 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(v) FRAGMENT TYPE: C-terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: E.coli 

(B) STRAIN: K12 

4-vn) IMMEDIATE SOURCE: 
(B) CLONE: MSA19 

(viii) POSITION IN GENOME: 

(B) MAP POSITION: lacZ 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GGAAGCTTAT. TTTTGACACC AGACCAAC 
(2) INFORMATION FOR SEQ ID NO: 11: 

- (l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(v) FRAGMENT TYPE: N-terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zea maize 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Lc5 

(viii) POSITION IN GENOME: 

(B) MAP POSITION: 5' end of color control gene Lc 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GTGATGGATC CTTCAGCTTC CCGAGTTCAG CAGGCGG 
(2) INFORMATION FOR SEQ ID NO: 12: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iv) ANTI- SENSE: YES 

(v) FRAGMENT TYPE: C-terminal 

-~-(vi) ORIGINAL SOURCE: 
* (A) ORGANISM: Zea maize 

}v±±) IMMEDIATE SOURCE: 
'I (B) CLONE: Lc3 

(viii) POSITION IN GENOME: 

(B) MAP POSITION: 3' end of color control gene Lc 



*(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 
GGTCTCGAGC GAAGCTTCCC TATAGCTTTG CGAAGAG 
(2) - INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Thermus aquaticus 

(B) STRAIN: YT1 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: KT2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GAGCCATGGC CAACCTGTGG GGGAGGCTTG AGGGGGA 
(2) INFORMATION FOR SEQ ID NO: 14: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Thermus aquaticus 

(B) STRAIN: YT1 



fvn) IMMEDIATE SOURCE: 

* (B) CLONE: Genbank Accession no. J04639 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME/ SEGMENT: DNA polymerase gene 

(B) MAP POSITION: 950 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
AGTT..TGGCAG CCTCCTCCAC GAGTTCGGCC TTCTGG 
(2) "INFORMATION FOR SEQ ID NO: 15: 

~"~ (l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Thermus aquaticus 

(B) STRAIN: YT1 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Genbank Accession No. J04 63 9 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME/ SEGMENT: DNA polymerase gene 

(B) MAP POSITION: 2595 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GGACTGGCTC TCCGCCAAGG AGTGATACCA CC 
(2) INFORMATION FOR SEQ ID NO: 16: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Thermus flavis 

Cvn) IMMEDIATE SOURCE: 

I (B) CLONE: Genbank Accession No. X66105 

(fail) POSITION IN GENOME: 

(A) CHROMOSOME /SEGMENT: DNA polymerase 
1" (B) MAP POSITION: 1378 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
AGTT-TGGAAG CCTCCTCCAC GAGTTCGGCC TCCTGG 
(2) -INFORMATION FOR SEQ ID NO: 17: 

- (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Thermus flavis 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Genbank Accession No. X66105 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME/ SEGMENT: DNA polymerase gene 

(B) MAP POSITION: 3023 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GGACTGGCTC TCCGCCAAGG AGTAGGGGGG TCCTG 
(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iv) ANTI- SENSE: YES 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 
~ (A) ORGANISM: Bacillus thuringiensis 

"i (B) STRAIN: CryV 

- (C) INDIVIDUAL ISOLATE: NRD12 

fvn) IMMEDIATE SOURCE: 
(B) CLONE: BtV3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GCGAAGCTTC TCGAGTTACG CTCAATATGG AGTTGCTTC 3 9 

(2) ; INFORMATION FOR SEQ ID NO:19: 

= (l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI -SENSE: NO 
(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: NRD12 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: BtV5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CCGAGATCTC CATGGATCCA AAGAATCAAG ATAAGCAT CA AAG 
(2) INFORMATION FOR SEQ ID NO: 20: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: bacteriophage lambda 

(B) STRAIN: PaPa 

"pvii) IMMEDIATE SOURCE: 
J (B) CLONE: L3 6 

(viii) POSITION IN GENOME: 

(B) MAP POSITION: left end 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GGGCGGCGAC CTCGCGGGTT TTCGCTATTT ATGAAA 3 6 

(2) "INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANT I- SENSE: YES 
(v) FRAGMENT TYPE: N-terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: E.coli 

(B) STRAIN: K12 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: lacZ'533 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
CGACGGCCAG TGAATCCGTA ATCATGGTCA TAG 
(2) INFORMATION FOR SEQ ID NO: 22: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(v) FRAGMENT TYPE: C-terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: E.coli 

(B) STRAIN: K12 

ivii) IMMEDIATE SOURCE: 

(B) CLONE: lacZ333 

(viii) POSITION IN GENOME: 

(B) MAP POSITION: lacZ 



^(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
ACCAGCCATC GCCATCTGCT GCACGCGGAA GAA 
(2) ^INFORMATION FOR SEQ ID NO: 23: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI- SENSE: NO 
(v) FRAGMENT TYPE: N-terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: E.coli 

(B) STRAIN: K12 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: lacZ536 

(viii) POSITION IN GENOME: 

(B) MAP POSITION: lacZ 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
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CTATGACCAT GATTACGGAT TCACTGGCCG TCGTTT 3 6 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

*(vi) ORIGINAL SOURCE : 

(A) ORGANISM: bacteriophage lambda 

(B) STRAIN: PaPa 

<Vii) IMMEDIATE SOURCE: 
(B) CLONE: MBL0 02 



=(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
GCAAGACTCT ATGAGAAGCA GATAAGCGAT AAG 33 
(2) INFORMATION FOR SEQ ID NO: 25: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: bacteriophage lambda 

(B) STRAIN: PaPa 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: MBL101 

(viii) POSITION IN GENOME: 

(B) MAP POSITION: 27840 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
ATCATTATTT GATTTCAATT TTGTCCCACT CCC 
(2) INFORMATION FOR SEQ ID NO: 26: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: bacteriophage lambda 

(B) STRAIN: PaPa 

#vii) IMMEDIATE SOURCE: 
J (B) CLONE: MBR0 01 

(¥iii) POSITION IN GENOME: 

(B) MAP POSITION: 34576 



£(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

GGAGAGAACC ATGTATATGA TCGTTATCTG GGT 3 3 

(2) ^INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 
f (A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: bacteriophage lambda 

(B) STRAIN: PaPa 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: MBR2 02 

(viii) POSITION IN GENOME: 

(B) MAP POSITION: 34793 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
GCGCACAAAA CCATAGATTG CTCTTCTGTA AGG 33 
(2) INFORMATION FOR SEQ ID NO: 28: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(v) FRAGMENT TYPE: C-terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: E.coli 
7 (B) STRAIN: K12 

>vii) IMMEDIATE SOURCE: 
C (B) CLONE: MSA193 3 

(viii) POSITION IN GENOME: 
~~ (B) MAP POSITION: lacZ 



*_(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
CCCGGTTATT ATTATTTTTG ACACCAGACC AAC 33 
(2) "INFORMATION FOR SEQ ID NO:29: 

% ' (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: bacteriophage lambda 

(B) STRAIN: PaPa 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: R3 6 

(viii) POSITION IN GENOME: 

(B) MAP POSITION: right end 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 9 : 
AGGTCGCCGC CCCGTAACCT GTCGGATCAC CGGAAA 
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